Overview
Brought to you by YData
Dataset statistics
| Number of variables | 70 |
|---|---|
| Number of observations | 724508 |
| Missing cells | 30334160 |
| Missing cells (%) | 59.8% |
| Total size in memory | 386.9 MiB |
| Average record size in memory | 560.0 B |
Variable types
| Text | 70 |
|---|
Dataset
| Description | NMNH Paleobiology Specimen Records (USNM) 0049391-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.ws2uf3 |
institutionID has constant value "http://biocol.org/urn:lsid:biocol.org:col:34871" | Constant |
collectionID has constant value "urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac" | Constant |
institutionCode has constant value "USNM" | Constant |
collectionCode has constant value "PAL" | Constant |
datasetName has constant value "NMNH Paleobiology (USNM)" | Constant |
basisOfRecord has constant value "FossilSpecimen" | Constant |
verbatimCoordinateSystem has constant value "Degrees Minutes Seconds" | Constant |
catalogNumber has 50535 (7.0%) missing values | Missing |
recordNumber has 675939 (93.3%) missing values | Missing |
recordedBy has 563497 (77.8%) missing values | Missing |
preparations has 591600 (81.7%) missing values | Missing |
associatedMedia has 637195 (87.9%) missing values | Missing |
occurrenceRemarks has 638259 (88.1%) missing values | Missing |
fieldNumber has 720044 (99.4%) missing values | Missing |
eventDate has 453741 (62.6%) missing values | Missing |
startDayOfYear has 571939 (78.9%) missing values | Missing |
endDayOfYear has 571953 (78.9%) missing values | Missing |
year has 453741 (62.6%) missing values | Missing |
month has 571556 (78.9%) missing values | Missing |
day has 593848 (82.0%) missing values | Missing |
verbatimEventDate has 445814 (61.5%) missing values | Missing |
locationID has 335037 (46.2%) missing values | Missing |
higherGeography has 148417 (20.5%) missing values | Missing |
continent has 210428 (29.0%) missing values | Missing |
waterBody has 696851 (96.2%) missing values | Missing |
islandGroup has 723710 (99.9%) missing values | Missing |
island has 714401 (98.6%) missing values | Missing |
country has 173269 (23.9%) missing values | Missing |
stateProvince has 226462 (31.3%) missing values | Missing |
county has 454433 (62.7%) missing values | Missing |
locality has 560871 (77.4%) missing values | Missing |
verbatimElevation has 724311 (> 99.9%) missing values | Missing |
verbatimDepth has 724424 (> 99.9%) missing values | Missing |
decimalLatitude has 620569 (85.7%) missing values | Missing |
decimalLongitude has 620569 (85.7%) missing values | Missing |
geodeticDatum has 698201 (96.4%) missing values | Missing |
verbatimLatitude has 724503 (> 99.9%) missing values | Missing |
verbatimLongitude has 724503 (> 99.9%) missing values | Missing |
verbatimCoordinateSystem has 654265 (90.3%) missing values | Missing |
georeferenceProtocol has 695012 (95.9%) missing values | Missing |
georeferenceRemarks has 724503 (> 99.9%) missing values | Missing |
earliestEraOrLowestErathem has 220036 (30.4%) missing values | Missing |
latestEraOrHighestErathem has 718163 (99.1%) missing values | Missing |
earliestPeriodOrLowestSystem has 245750 (33.9%) missing values | Missing |
latestPeriodOrHighestSystem has 718167 (99.1%) missing values | Missing |
earliestEpochOrLowestSeries has 376914 (52.0%) missing values | Missing |
latestEpochOrHighestSeries has 718290 (99.1%) missing values | Missing |
earliestAgeOrLowestStage has 562472 (77.6%) missing values | Missing |
latestAgeOrHighestStage has 722133 (99.7%) missing values | Missing |
group has 633218 (87.4%) missing values | Missing |
formation has 365706 (50.5%) missing values | Missing |
member has 643191 (88.8%) missing values | Missing |
typeStatus has 581882 (80.3%) missing values | Missing |
identifiedBy has 521981 (72.0%) missing values | Missing |
scientificName has 171332 (23.6%) missing values | Missing |
higherClassification has 172643 (23.8%) missing values | Missing |
kingdom has 172847 (23.9%) missing values | Missing |
phylum has 211856 (29.2%) missing values | Missing |
class has 235611 (32.5%) missing values | Missing |
order has 400004 (55.2%) missing values | Missing |
family has 409455 (56.5%) missing values | Missing |
genus has 197061 (27.2%) missing values | Missing |
subgenus has 702202 (96.9%) missing values | Missing |
specificEpithet has 197674 (27.3%) missing values | Missing |
infraspecificEpithet has 708037 (97.7%) missing values | Missing |
taxonRank has 707802 (97.7%) missing values | Missing |
scientificNameAuthorship has 325030 (44.9%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
Reproduction
| Analysis started | 2025-01-14 16:33:33.665307 |
|---|---|
| Analysis finished | 2025-01-14 16:33:50.709019 |
| Duration | 17.04 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 724508 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 724508 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1316557253 |
|---|---|
| 2nd row | 2235727162 |
| 3rd row | 1316557263 |
| 4th row | 1316557258 |
| 5th row | 1316557269 |
| Value | Count | Frequency (%) |
| 1316557253 | 1 | < 0.1% |
| 1316557860 | 1 | < 0.1% |
| 1316557419 | 1 | < 0.1% |
| 1316557667 | 1 | < 0.1% |
| 1316557340 | 1 | < 0.1% |
| 1316557263 | 1 | < 0.1% |
| 1316557258 | 1 | < 0.1% |
| 1316557269 | 1 | < 0.1% |
| 1316557294 | 1 | < 0.1% |
| 3311036301 | 1 | < 0.1% |
| Other values (724498) | 724498 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1858630 | |
| 3 | 1114337 | |
| 6 | 924334 | |
| 7 | 682226 | 9.4% |
| 0 | 507951 | 7.0% |
| 8 | 482636 | 6.7% |
| 9 | 467327 | 6.5% |
| 5 | 426943 | 5.9% |
| 2 | 401616 | 5.5% |
| 4 | 379080 | 5.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7245080 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1858630 | |
| 3 | 1114337 | |
| 6 | 924334 | |
| 7 | 682226 | 9.4% |
| 0 | 507951 | 7.0% |
| 8 | 482636 | 6.7% |
| 9 | 467327 | 6.5% |
| 5 | 426943 | 5.9% |
| 2 | 401616 | 5.5% |
| 4 | 379080 | 5.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7245080 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1858630 | |
| 3 | 1114337 | |
| 6 | 924334 | |
| 7 | 682226 | 9.4% |
| 0 | 507951 | 7.0% |
| 8 | 482636 | 6.7% |
| 9 | 467327 | 6.5% |
| 5 | 426943 | 5.9% |
| 2 | 401616 | 5.5% |
| 4 | 379080 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7245080 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1858630 | |
| 3 | 1114337 | |
| 6 | 924334 | |
| 7 | 682226 | 9.4% |
| 0 | 507951 | 7.0% |
| 8 | 482636 | 6.7% |
| 9 | 467327 | 6.5% |
| 5 | 426943 | 5.9% |
| 2 | 401616 | 5.5% |
| 4 | 379080 | 5.2% |
modified
Text
| Distinct | 6008 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 1783 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 2014-11-25 18:32:00 |
|---|---|
| 2nd row | 2024-10-17 09:58:00 |
| 3rd row | 2024-10-17 10:44:00 |
| 4th row | 2024-08-03 21:41:00 |
| 5th row | 2024-10-17 10:17:00 |
| Value | Count | Frequency (%) |
| 2024-10-17 | 379839 | |
| 2024-08-03 | 110663 | 7.6% |
| 2014-12-01 | 62342 | 4.3% |
| 2014-11-25 | 62169 | 4.3% |
| 2024-11-18 | 18663 | 1.3% |
| 2014-11-26 | 16425 | 1.1% |
| 2022-07-29 | 12130 | 0.8% |
| 22:06:00 | 11127 | 0.8% |
| 11:08:00 | 10895 | 0.8% |
| 22:09:00 | 9244 | 0.6% |
| Other values (1703) | 755519 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 3567224 | |
| 1 | 2229486 | |
| 2 | 1840704 | |
| - | 1449016 | |
| : | 1449016 | |
| 4 | 856419 | 6.2% |
| 724508 | 5.3% | |
| 7 | 523431 | 3.8% |
| 3 | 323301 | 2.3% |
| 8 | 267407 | 1.9% |
| Other values (3) | 535140 | 3.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 10143112 | |
| Dash Punctuation | 1449016 | 10.5% |
| Other Punctuation | 1449016 | 10.5% |
| Space Separator | 724508 | 5.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3567224 | |
| 1 | 2229486 | |
| 2 | 1840704 | |
| 4 | 856419 | 8.4% |
| 7 | 523431 | 5.2% |
| 3 | 323301 | 3.2% |
| 8 | 267407 | 2.6% |
| 5 | 251997 | 2.5% |
| 9 | 156334 | 1.5% |
| 6 | 126809 | 1.3% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1449016 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1449016 |
Space Separator
| Value | Count | Frequency (%) |
| 724508 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 13765652 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 3567224 | |
| 1 | 2229486 | |
| 2 | 1840704 | |
| - | 1449016 | |
| : | 1449016 | |
| 4 | 856419 | 6.2% |
| 724508 | 5.3% | |
| 7 | 523431 | 3.8% |
| 3 | 323301 | 2.3% |
| 8 | 267407 | 1.9% |
| Other values (3) | 535140 | 3.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13765652 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 3567224 | |
| 1 | 2229486 | |
| 2 | 1840704 | |
| - | 1449016 | |
| : | 1449016 | |
| 4 | 856419 | 6.2% |
| 724508 | 5.3% | |
| 7 | 523431 | 3.8% |
| 3 | 323301 | 2.3% |
| 8 | 267407 | 1.9% |
| Other values (3) | 535140 | 3.9% |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 47 |
|---|---|
| Median length | 47 |
| Mean length | 47 |
| Min length | 47 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | http://biocol.org/urn:lsid:biocol.org:col:34871 |
|---|---|
| 2nd row | http://biocol.org/urn:lsid:biocol.org:col:34871 |
| 3rd row | http://biocol.org/urn:lsid:biocol.org:col:34871 |
| 4th row | http://biocol.org/urn:lsid:biocol.org:col:34871 |
| 5th row | http://biocol.org/urn:lsid:biocol.org:col:34871 |
| Value | Count | Frequency (%) |
| http://biocol.org/urn:lsid:biocol.org:col:34871 | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 5071556 | |
| : | 3622540 | 10.6% |
| l | 2898032 | 8.5% |
| r | 2173524 | 6.4% |
| / | 2173524 | 6.4% |
| i | 2173524 | 6.4% |
| c | 2173524 | 6.4% |
| b | 1449016 | 4.3% |
| . | 1449016 | 4.3% |
| t | 1449016 | 4.3% |
| Other values (12) | 9418604 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 23184256 | |
| Other Punctuation | 7245080 | 21.3% |
| Decimal Number | 3622540 | 10.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 5071556 | |
| l | 2898032 | |
| r | 2173524 | |
| i | 2173524 | |
| c | 2173524 | |
| b | 1449016 | 6.2% |
| t | 1449016 | 6.2% |
| g | 1449016 | 6.2% |
| d | 724508 | 3.1% |
| h | 724508 | 3.1% |
| Other values (4) | 2898032 |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 724508 | |
| 8 | 724508 | |
| 4 | 724508 | |
| 3 | 724508 | |
| 1 | 724508 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 3622540 | |
| / | 2173524 | |
| . | 1449016 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 23184256 | |
| Common | 10867620 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 5071556 | |
| l | 2898032 | |
| r | 2173524 | |
| i | 2173524 | |
| c | 2173524 | |
| b | 1449016 | 6.2% |
| t | 1449016 | 6.2% |
| g | 1449016 | 6.2% |
| d | 724508 | 3.1% |
| h | 724508 | 3.1% |
| Other values (4) | 2898032 |
Common
| Value | Count | Frequency (%) |
| : | 3622540 | |
| / | 2173524 | |
| . | 1449016 | 13.3% |
| 7 | 724508 | 6.7% |
| 8 | 724508 | 6.7% |
| 4 | 724508 | 6.7% |
| 3 | 724508 | 6.7% |
| 1 | 724508 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 34051876 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 5071556 | |
| : | 3622540 | 10.6% |
| l | 2898032 | 8.5% |
| r | 2173524 | 6.4% |
| / | 2173524 | 6.4% |
| i | 2173524 | 6.4% |
| c | 2173524 | 6.4% |
| b | 1449016 | 4.3% |
| . | 1449016 | 4.3% |
| t | 1449016 | 4.3% |
| Other values (12) | 9418604 |
collectionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 44 |
|---|---|
| Median length | 44 |
| Mean length | 44 |
| Min length | 44 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac |
|---|---|
| 2nd row | urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac |
| 3rd row | urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac |
| 4th row | urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac |
| 5th row | urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac |
| Value | Count | Frequency (%) |
| urn:uuid:ce595e88-ceba-42c0-a3ff-cd55b694fac | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 3622540 | 11.4% |
| - | 2898032 | 9.1% |
| 5 | 2898032 | 9.1% |
| u | 2173524 | 6.8% |
| f | 2173524 | 6.8% |
| a | 2173524 | 6.8% |
| e | 2173524 | 6.8% |
| 4 | 1449016 | 4.5% |
| b | 1449016 | 4.5% |
| 8 | 1449016 | 4.5% |
| Other values (10) | 9418604 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17388192 | |
| Decimal Number | 10143112 | |
| Dash Punctuation | 2898032 | 9.1% |
| Other Punctuation | 1449016 | 4.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 3622540 | |
| u | 2173524 | |
| f | 2173524 | |
| a | 2173524 | |
| e | 2173524 | |
| b | 1449016 | 8.3% |
| d | 1449016 | 8.3% |
| r | 724508 | 4.2% |
| i | 724508 | 4.2% |
| n | 724508 | 4.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 2898032 | |
| 4 | 1449016 | |
| 8 | 1449016 | |
| 9 | 1449016 | |
| 2 | 724508 | 7.1% |
| 0 | 724508 | 7.1% |
| 3 | 724508 | 7.1% |
| 6 | 724508 | 7.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2898032 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1449016 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17388192 | |
| Common | 14490160 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| c | 3622540 | |
| u | 2173524 | |
| f | 2173524 | |
| a | 2173524 | |
| e | 2173524 | |
| b | 1449016 | 8.3% |
| d | 1449016 | 8.3% |
| r | 724508 | 4.2% |
| i | 724508 | 4.2% |
| n | 724508 | 4.2% |
Common
| Value | Count | Frequency (%) |
| - | 2898032 | |
| 5 | 2898032 | |
| 4 | 1449016 | |
| 8 | 1449016 | |
| 9 | 1449016 | |
| : | 1449016 | |
| 2 | 724508 | 5.0% |
| 0 | 724508 | 5.0% |
| 3 | 724508 | 5.0% |
| 6 | 724508 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31878352 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 3622540 | 11.4% |
| - | 2898032 | 9.1% |
| 5 | 2898032 | 9.1% |
| u | 2173524 | 6.8% |
| f | 2173524 | 6.8% |
| a | 2173524 | 6.8% |
| e | 2173524 | 6.8% |
| 4 | 1449016 | 4.5% |
| b | 1449016 | 4.5% |
| 8 | 1449016 | 4.5% |
| Other values (10) | 9418604 |
institutionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | USNM |
|---|---|
| 2nd row | USNM |
| 3rd row | USNM |
| 4th row | USNM |
| 5th row | USNM |
| Value | Count | Frequency (%) |
| usnm | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 724508 | |
| S | 724508 | |
| N | 724508 | |
| M | 724508 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2898032 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 724508 | |
| S | 724508 | |
| N | 724508 | |
| M | 724508 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2898032 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 724508 | |
| S | 724508 | |
| N | 724508 | |
| M | 724508 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2898032 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 724508 | |
| S | 724508 | |
| N | 724508 | |
| M | 724508 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PAL |
|---|---|
| 2nd row | PAL |
| 3rd row | PAL |
| 4th row | PAL |
| 5th row | PAL |
| Value | Count | Frequency (%) |
| pal | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| P | 724508 | |
| A | 724508 | |
| L | 724508 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2173524 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 724508 | |
| A | 724508 | |
| L | 724508 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2173524 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| P | 724508 | |
| A | 724508 | |
| L | 724508 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2173524 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| P | 724508 | |
| A | 724508 | |
| L | 724508 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Paleobiology (USNM) |
|---|---|
| 2nd row | NMNH Paleobiology (USNM) |
| 3rd row | NMNH Paleobiology (USNM) |
| 4th row | NMNH Paleobiology (USNM) |
| 5th row | NMNH Paleobiology (USNM) |
| Value | Count | Frequency (%) |
| nmnh | 724508 | |
| paleobiology | 724508 | |
| usnm | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 2173524 | |
| o | 2173524 | |
| 1449016 | 8.3% | |
| l | 1449016 | 8.3% |
| M | 1449016 | 8.3% |
| H | 724508 | 4.2% |
| P | 724508 | 4.2% |
| a | 724508 | 4.2% |
| e | 724508 | 4.2% |
| b | 724508 | 4.2% |
| Other values (7) | 5071556 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7969588 | |
| Uppercase Letter | 6520572 | |
| Space Separator | 1449016 | 8.3% |
| Open Punctuation | 724508 | 4.2% |
| Close Punctuation | 724508 | 4.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2173524 | |
| l | 1449016 | |
| a | 724508 | 9.1% |
| e | 724508 | 9.1% |
| b | 724508 | 9.1% |
| i | 724508 | 9.1% |
| g | 724508 | 9.1% |
| y | 724508 | 9.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 2173524 | |
| M | 1449016 | |
| H | 724508 | 11.1% |
| P | 724508 | 11.1% |
| U | 724508 | 11.1% |
| S | 724508 | 11.1% |
Space Separator
| Value | Count | Frequency (%) |
| 1449016 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 724508 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 724508 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 14490160 | |
| Common | 2898032 | 16.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 2173524 | |
| o | 2173524 | |
| l | 1449016 | |
| M | 1449016 | |
| H | 724508 | 5.0% |
| P | 724508 | 5.0% |
| a | 724508 | 5.0% |
| e | 724508 | 5.0% |
| b | 724508 | 5.0% |
| i | 724508 | 5.0% |
| Other values (4) | 2898032 |
Common
| Value | Count | Frequency (%) |
| 1449016 | ||
| ( | 724508 | |
| ) | 724508 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17388192 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 2173524 | |
| o | 2173524 | |
| 1449016 | 8.3% | |
| l | 1449016 | 8.3% |
| M | 1449016 | 8.3% |
| H | 724508 | 4.2% |
| P | 724508 | 4.2% |
| a | 724508 | 4.2% |
| e | 724508 | 4.2% |
| b | 724508 | 4.2% |
| Other values (7) | 5071556 |
basisOfRecord
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 14 |
| Min length | 14 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | FossilSpecimen |
|---|---|
| 2nd row | FossilSpecimen |
| 3rd row | FossilSpecimen |
| 4th row | FossilSpecimen |
| 5th row | FossilSpecimen |
| Value | Count | Frequency (%) |
| fossilspecimen | 724508 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 1449016 | |
| i | 1449016 | |
| e | 1449016 | |
| F | 724508 | |
| o | 724508 | |
| l | 724508 | |
| S | 724508 | |
| p | 724508 | |
| c | 724508 | |
| m | 724508 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8694096 | |
| Uppercase Letter | 1449016 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 1449016 | |
| i | 1449016 | |
| e | 1449016 | |
| o | 724508 | |
| l | 724508 | |
| p | 724508 | |
| c | 724508 | |
| m | 724508 | |
| n | 724508 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 724508 | |
| S | 724508 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10143112 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 1449016 | |
| i | 1449016 | |
| e | 1449016 | |
| F | 724508 | |
| o | 724508 | |
| l | 724508 | |
| S | 724508 | |
| p | 724508 | |
| c | 724508 | |
| m | 724508 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10143112 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 1449016 | |
| i | 1449016 | |
| e | 1449016 | |
| F | 724508 | |
| o | 724508 | |
| l | 724508 | |
| S | 724508 | |
| p | 724508 | |
| c | 724508 | |
| m | 724508 |
occurrenceID
Text
Unique 
| Distinct | 724508 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 724508 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/300009e1e-4f3e-4240-b198-9ea1352b28b5 |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/30000a59d-34e5-42b6-837d-ad1b89b6b930 |
| 3rd row | http://n2t.net/ark:/65665/3000109b9-b6d6-4ca0-8f0c-ddde53458300 |
| 4th row | http://n2t.net/ark:/65665/30001bcd8-61d5-492a-ad56-f8131f24bdaa |
| 5th row | http://n2t.net/ark:/65665/300020a6b-970f-4e44-adb4-6d605be80b0d |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/300009e1e-4f3e-4240-b198-9ea1352b28b5 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3004266bd-f222-4227-9817-5905ac4cbc57 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/30011b937-0eb9-4c75-bea7-c27393598b76 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3002cb891-3b1b-49d8-84ee-8558aba9bf13 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3000a6387-0469-4278-8ac0-fb0ac6fd37d6 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3000109b9-b6d6-4ca0-8f0c-ddde53458300 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/30001bcd8-61d5-492a-ad56-f8131f24bdaa | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300020a6b-970f-4e44-adb4-6d605be80b0d | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300045523-2307-4a34-b888-fb51510870ad | 1 | < 0.1% |
| http://n2t.net/ark:/65665/300045db2-681e-481a-836e-3643bf3debbf | 1 | < 0.1% |
| Other values (724498) | 724498 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 3622540 | 7.9% |
| 6 | 3531516 | 7.7% |
| - | 2898032 | 6.3% |
| t | 2898032 | 6.3% |
| 5 | 2808306 | 6.2% |
| a | 2263386 | 5.0% |
| e | 2084462 | 4.6% |
| 2 | 2083197 | 4.6% |
| 3 | 2083153 | 4.6% |
| 4 | 2081137 | 4.6% |
| Other values (16) | 19290243 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 19743301 | |
| Lowercase Letter | 17206607 | |
| Other Punctuation | 5796064 | 12.7% |
| Dash Punctuation | 2898032 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2898032 | |
| a | 2263386 | |
| e | 2084462 | |
| b | 1539404 | |
| n | 1449016 | |
| c | 1358538 | |
| d | 1358025 | |
| f | 1357712 | |
| k | 724508 | 4.2% |
| r | 724508 | 4.2% |
| Other values (2) | 1449016 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 3531516 | |
| 5 | 2808306 | |
| 2 | 2083197 | |
| 3 | 2083153 | |
| 4 | 2081137 | |
| 8 | 1539173 | |
| 9 | 1539102 | |
| 0 | 1359375 | 6.9% |
| 7 | 1359374 | 6.9% |
| 1 | 1358968 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 3622540 | |
| : | 1449016 | 25.0% |
| . | 724508 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2898032 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 28437397 | |
| Latin | 17206607 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 3622540 | |
| 6 | 3531516 | |
| - | 2898032 | |
| 5 | 2808306 | |
| 2 | 2083197 | |
| 3 | 2083153 | |
| 4 | 2081137 | |
| 8 | 1539173 | 5.4% |
| 9 | 1539102 | 5.4% |
| : | 1449016 | 5.1% |
| Other values (4) | 4802225 |
Latin
| Value | Count | Frequency (%) |
| t | 2898032 | |
| a | 2263386 | |
| e | 2084462 | |
| b | 1539404 | |
| n | 1449016 | |
| c | 1358538 | |
| d | 1358025 | |
| f | 1357712 | |
| k | 724508 | 4.2% |
| r | 724508 | 4.2% |
| Other values (2) | 1449016 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 45644004 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 3622540 | 7.9% |
| 6 | 3531516 | 7.7% |
| - | 2898032 | 6.3% |
| t | 2898032 | 6.3% |
| 5 | 2808306 | 6.2% |
| a | 2263386 | 5.0% |
| e | 2084462 | 4.6% |
| 2 | 2083197 | 4.6% |
| 3 | 2083153 | 4.6% |
| 4 | 2081137 | 4.6% |
| Other values (16) | 19290243 |
catalogNumber
Text
Missing 
| Distinct | 655081 |
|---|---|
| Distinct (%) | 97.2% |
| Missing | 50535 |
| Missing (%) | 7.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 14 |
| Mean length | 13.86868317 |
| Min length | 7 |
Unique
| Unique | 638257 ? |
|---|---|
| Unique (%) | 94.7% |
Sample
| 1st row | USNM SD38013 0000 |
|---|---|
| 2nd row | USNM PAL706968 |
| 3rd row | USNM PAL248638 |
| 4th row | USNM PAL456768 |
| 5th row | USNM PAL297724 |
| Value | Count | Frequency (%) |
| usnm | 673973 | |
| 0000 | 59177 | 4.2% |
| 0002 | 159 | < 0.1% |
| 0001 | 159 | < 0.1% |
| 0003 | 149 | < 0.1% |
| 0004 | 145 | < 0.1% |
| 0005 | 137 | < 0.1% |
| 0006 | 116 | < 0.1% |
| 0007 | 113 | < 0.1% |
| 0008 | 105 | < 0.1% |
| Other values (652937) | 674632 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 742844 | 7.9% |
| 734892 | 7.9% | |
| M | 712585 | 7.6% |
| N | 674519 | 7.2% |
| U | 674214 | 7.2% |
| 0 | 557394 | 6.0% |
| P | 521957 | 5.6% |
| A | 511374 | 5.5% |
| L | 497601 | 5.3% |
| 1 | 444334 | 4.8% |
| Other values (58) | 3275404 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 4546936 | |
| Decimal Number | 4063828 | |
| Space Separator | 734892 | 7.9% |
| Other Punctuation | 741 | < 0.1% |
| Lowercase Letter | 690 | < 0.1% |
| Dash Punctuation | 30 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 742844 | |
| M | 712585 | |
| N | 674519 | |
| U | 674214 | |
| P | 521957 | |
| A | 511374 | |
| L | 497601 | |
| D | 65264 | 1.4% |
| C | 43992 | 1.0% |
| O | 38427 | 0.8% |
| Other values (16) | 64159 | 1.4% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 130 | |
| b | 126 | |
| d | 61 | |
| e | 54 | |
| c | 50 | 7.2% |
| o | 38 | 5.5% |
| l | 31 | 4.5% |
| f | 27 | 3.9% |
| r | 26 | 3.8% |
| k | 23 | 3.3% |
| Other values (16) | 124 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 557394 | |
| 1 | 444334 | |
| 3 | 432709 | |
| 5 | 423320 | |
| 2 | 419515 | |
| 4 | 412173 | |
| 6 | 395612 | |
| 7 | 350867 | |
| 8 | 318934 | |
| 9 | 308970 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 704 | |
| " | 35 | 4.7% |
| , | 2 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 734892 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 30 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4799492 | |
| Latin | 4547626 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 742844 | |
| M | 712585 | |
| N | 674519 | |
| U | 674214 | |
| P | 521957 | |
| A | 511374 | |
| L | 497601 | |
| D | 65264 | 1.4% |
| C | 43992 | 1.0% |
| O | 38427 | 0.8% |
| Other values (42) | 64849 | 1.4% |
Common
| Value | Count | Frequency (%) |
| 734892 | ||
| 0 | 557394 | |
| 1 | 444334 | |
| 3 | 432709 | |
| 5 | 423320 | |
| 2 | 419515 | |
| 4 | 412173 | |
| 6 | 395612 | |
| 7 | 350867 | |
| 8 | 318934 | |
| Other values (6) | 309742 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9347118 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 742844 | 7.9% |
| 734892 | 7.9% | |
| M | 712585 | 7.6% |
| N | 674519 | 7.2% |
| U | 674214 | 7.2% |
| 0 | 557394 | 6.0% |
| P | 521957 | 5.6% |
| A | 511374 | 5.5% |
| L | 497601 | 5.3% |
| 1 | 444334 | 4.8% |
| Other values (58) | 3275404 |
recordNumber
Text
Missing 
| Distinct | 39872 |
|---|---|
| Distinct (%) | 82.1% |
| Missing | 675939 |
| Missing (%) | 93.3% |
| Memory size | 5.5 MiB |
Length
| Max length | 48 |
|---|---|
| Median length | 5 |
| Mean length | 6.205336737 |
| Min length | 1 |
Unique
| Unique | 37721 ? |
|---|---|
| Unique (%) | 77.7% |
Sample
| 1st row | PALMER LOC 1479 |
|---|---|
| 2nd row | 75432 |
| 3rd row | H-11 |
| 4th row | E73-59 |
| 5th row | Gaxin Loc 178-36 |
| Value | Count | Frequency (%) |
| loc | 1685 | 2.9% |
| emlong | 951 | 1.7% |
| urbac | 803 | 1.4% |
| olson | 263 | 0.5% |
| sample | 209 | 0.4% |
| hass | 177 | 0.3% |
| rb | 171 | 0.3% |
| c-29 | 169 | 0.3% |
| gibson | 163 | 0.3% |
| wyo | 162 | 0.3% |
| Other values (38506) | 52476 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 30021 | 10.0% |
| 5 | 27939 | 9.3% |
| 7 | 23690 | 7.9% |
| 2 | 21570 | 7.2% |
| 3 | 20657 | 6.9% |
| 6 | 18998 | 6.3% |
| 8 | 18791 | 6.2% |
| 0 | 17388 | 5.8% |
| 4 | 17006 | 5.6% |
| - | 16559 | 5.5% |
| Other values (67) | 88768 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 211386 | |
| Uppercase Letter | 58763 | 19.5% |
| Dash Punctuation | 16559 | 5.5% |
| Space Separator | 8660 | 2.9% |
| Other Punctuation | 3199 | 1.1% |
| Lowercase Letter | 2471 | 0.8% |
| Math Symbol | 145 | < 0.1% |
| Close Punctuation | 102 | < 0.1% |
| Open Punctuation | 101 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 5593 | 9.5% |
| E | 4986 | 8.5% |
| L | 4981 | 8.5% |
| C | 4891 | 8.3% |
| S | 4262 | 7.3% |
| A | 4151 | 7.1% |
| M | 3190 | 5.4% |
| R | 3078 | 5.2% |
| N | 3020 | 5.1% |
| B | 2373 | 4.0% |
| Other values (16) | 18238 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 425 | |
| n | 315 | |
| a | 217 | |
| y | 190 | |
| l | 189 | |
| c | 189 | |
| e | 172 | |
| i | 169 | 6.8% |
| r | 167 | 6.8% |
| t | 82 | 3.3% |
| Other values (14) | 356 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 30021 | |
| 5 | 27939 | |
| 7 | 23690 | |
| 2 | 21570 | |
| 3 | 20657 | |
| 6 | 18998 | |
| 8 | 18791 | |
| 0 | 17388 | |
| 4 | 17006 | |
| 9 | 15326 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 1630 | |
| . | 955 | |
| , | 516 | 16.1% |
| ? | 56 | 1.8% |
| ' | 22 | 0.7% |
| ; | 12 | 0.4% |
| # | 5 | 0.2% |
| : | 2 | 0.1% |
| & | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 135 | |
| = | 10 | 6.9% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 100 | |
| } | 2 | 2.0% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 16559 |
Space Separator
| Value | Count | Frequency (%) |
| 8660 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 101 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 240153 | |
| Latin | 61234 | 20.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 5593 | 9.1% |
| E | 4986 | 8.1% |
| L | 4981 | 8.1% |
| C | 4891 | 8.0% |
| S | 4262 | 7.0% |
| A | 4151 | 6.8% |
| M | 3190 | 5.2% |
| R | 3078 | 5.0% |
| N | 3020 | 4.9% |
| B | 2373 | 3.9% |
| Other values (40) | 20709 |
Common
| Value | Count | Frequency (%) |
| 1 | 30021 | |
| 5 | 27939 | |
| 7 | 23690 | |
| 2 | 21570 | |
| 3 | 20657 | |
| 6 | 18998 | |
| 8 | 18791 | |
| 0 | 17388 | |
| 4 | 17006 | |
| - | 16559 | |
| Other values (17) | 27534 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 301387 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 30021 | 10.0% |
| 5 | 27939 | 9.3% |
| 7 | 23690 | 7.9% |
| 2 | 21570 | 7.2% |
| 3 | 20657 | 6.9% |
| 6 | 18998 | 6.3% |
| 8 | 18791 | 6.2% |
| 0 | 17388 | 5.8% |
| 4 | 17006 | 5.6% |
| - | 16559 | 5.5% |
| Other values (67) | 88768 |
recordedBy
Text
Missing 
| Distinct | 3957 |
|---|---|
| Distinct (%) | 2.5% |
| Missing | 563497 |
| Missing (%) | 77.8% |
| Memory size | 5.5 MiB |
Length
| Max length | 119 |
|---|---|
| Median length | 61 |
| Mean length | 10.93147052 |
| Min length | 1 |
Unique
| Unique | 1329 ? |
|---|---|
| Unique (%) | 0.8% |
Sample
| 1st row | R. Snow |
|---|---|
| 2nd row | D. Palmer |
| 3rd row | W. Woodring & L. Lupher |
| 4th row | James |
| 5th row | Ross |
| Value | Count | Frequency (%) |
| 21228 | 6.1% | |
| j | 19727 | 5.7% |
| r | 15376 | 4.5% |
| w | 14249 | 4.1% |
| a | 12060 | 3.5% |
| james | 11468 | 3.3% |
| l | 10757 | 3.1% |
| woodring | 9356 | 2.7% |
| pribyl | 8943 | 2.6% |
| c | 7362 | 2.1% |
| Other values (2560) | 214833 |
Most occurring characters
| Value | Count | Frequency (%) |
| 184348 | 10.5% | |
| e | 133592 | 7.6% |
| . | 131492 | 7.5% |
| r | 102132 | 5.8% |
| o | 91217 | 5.2% |
| l | 89319 | 5.1% |
| n | 89079 | 5.1% |
| a | 84651 | 4.8% |
| i | 80231 | 4.6% |
| s | 70452 | 4.0% |
| Other values (51) | 703574 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1075097 | |
| Uppercase Letter | 337569 | 19.2% |
| Space Separator | 184348 | 10.5% |
| Other Punctuation | 160539 | 9.1% |
| Dash Punctuation | 2462 | 0.1% |
| Open Punctuation | 36 | < 0.1% |
| Close Punctuation | 36 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 133592 | |
| r | 102132 | |
| o | 91217 | 8.5% |
| l | 89319 | 8.3% |
| n | 89079 | 8.3% |
| a | 84651 | 7.9% |
| i | 80231 | 7.5% |
| s | 70452 | 6.6% |
| t | 48464 | 4.5% |
| d | 48173 | 4.5% |
| Other values (18) | 237787 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 36000 | 10.7% |
| W | 33626 | 10.0% |
| A | 27177 | 8.1% |
| R | 24357 | 7.2% |
| P | 20822 | 6.2% |
| C | 20595 | 6.1% |
| M | 19813 | 5.9% |
| S | 19479 | 5.8% |
| L | 18797 | 5.6% |
| H | 15162 | 4.5% |
| Other values (15) | 101741 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 131492 | |
| & | 21228 | 13.2% |
| , | 7789 | 4.9% |
| ' | 30 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 184348 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2462 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 36 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 36 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1412666 | |
| Common | 347421 | 19.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 133592 | 9.5% |
| r | 102132 | 7.2% |
| o | 91217 | 6.5% |
| l | 89319 | 6.3% |
| n | 89079 | 6.3% |
| a | 84651 | 6.0% |
| i | 80231 | 5.7% |
| s | 70452 | 5.0% |
| t | 48464 | 3.4% |
| d | 48173 | 3.4% |
| Other values (43) | 575356 |
Common
| Value | Count | Frequency (%) |
| 184348 | ||
| . | 131492 | |
| & | 21228 | 6.1% |
| , | 7789 | 2.2% |
| - | 2462 | 0.7% |
| ( | 36 | < 0.1% |
| ) | 36 | < 0.1% |
| ' | 30 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1760046 | |
| None | 41 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 184348 | 10.5% | |
| e | 133592 | 7.6% |
| . | 131492 | 7.5% |
| r | 102132 | 5.8% |
| o | 91217 | 5.2% |
| l | 89319 | 5.1% |
| n | 89079 | 5.1% |
| a | 84651 | 4.8% |
| i | 80231 | 4.6% |
| s | 70452 | 4.0% |
| Other values (49) | 703533 |
None
| Value | Count | Frequency (%) |
| ú | 40 | |
| č | 1 | 2.4% |
individualCount
Text
| Distinct | 686 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 303 |
| Missing (%) | < 0.1% |
| Memory size | 5.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 1 |
| Mean length | 1.088909908 |
| Min length | 1 |
Unique
| Unique | 253 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 25 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 594864 | |
| 2 | 29629 | 4.1% |
| 3 | 14673 | 2.0% |
| 4 | 9858 | 1.4% |
| 5 | 7420 | 1.0% |
| 6 | 5780 | 0.8% |
| 7 | 4510 | 0.6% |
| 8 | 3695 | 0.5% |
| 10 | 3151 | 0.4% |
| 9 | 3129 | 0.4% |
| Other values (676) | 47496 | 6.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 624602 | |
| 2 | 43921 | 5.6% |
| 0 | 28217 | 3.6% |
| 3 | 23988 | 3.0% |
| 5 | 17293 | 2.2% |
| 4 | 17104 | 2.2% |
| 6 | 10762 | 1.4% |
| 7 | 9146 | 1.2% |
| 8 | 7494 | 1.0% |
| 9 | 6067 | 0.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 788594 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 624602 | |
| 2 | 43921 | 5.6% |
| 0 | 28217 | 3.6% |
| 3 | 23988 | 3.0% |
| 5 | 17293 | 2.2% |
| 4 | 17104 | 2.2% |
| 6 | 10762 | 1.4% |
| 7 | 9146 | 1.2% |
| 8 | 7494 | 1.0% |
| 9 | 6067 | 0.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 788594 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 624602 | |
| 2 | 43921 | 5.6% |
| 0 | 28217 | 3.6% |
| 3 | 23988 | 3.0% |
| 5 | 17293 | 2.2% |
| 4 | 17104 | 2.2% |
| 6 | 10762 | 1.4% |
| 7 | 9146 | 1.2% |
| 8 | 7494 | 1.0% |
| 9 | 6067 | 0.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 788594 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 624602 | |
| 2 | 43921 | 5.6% |
| 0 | 28217 | 3.6% |
| 3 | 23988 | 3.0% |
| 5 | 17293 | 2.2% |
| 4 | 17104 | 2.2% |
| 6 | 10762 | 1.4% |
| 7 | 9146 | 1.2% |
| 8 | 7494 | 1.0% |
| 9 | 6067 | 0.8% |
preparations
Text
Missing 
| Distinct | 381 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 591600 |
| Missing (%) | 81.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 94 |
|---|---|
| Median length | 91 |
| Mean length | 16.14684594 |
| Min length | 3 |
Unique
| Unique | 130 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Boxes and vials |
|---|---|
| 2nd row | Thin sections |
| 3rd row | Secondary microslides |
| 4th row | Wet |
| 5th row | plastic container |
| Value | Count | Frequency (%) |
| microslide | 45697 | |
| microslides | 34837 | |
| secondary | 33230 | |
| remnants | 26629 | |
| thin | 24547 | |
| sections | 24011 | |
| no | 15071 | 5.8% |
| with | 10919 | 4.2% |
| unsectioned | 9109 | 3.5% |
| bottle | 3934 | 1.5% |
| Other values (53) | 32636 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 236706 | |
| s | 211809 | |
| e | 210870 | |
| n | 172401 | 8.0% |
| o | 167894 | 7.8% |
| c | 147453 | 6.9% |
| r | 146905 | 6.8% |
| d | 130804 | 6.1% |
| 127712 | 6.0% | |
| l | 92477 | 4.3% |
| Other values (41) | 501014 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1849130 | |
| Uppercase Letter | 159097 | 7.4% |
| Space Separator | 127712 | 6.0% |
| Other Punctuation | 10096 | 0.5% |
| Open Punctuation | 5 | < 0.1% |
| Close Punctuation | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 236706 | |
| s | 211809 | |
| e | 210870 | |
| n | 172401 | |
| o | 167894 | |
| c | 147453 | |
| r | 146905 | |
| d | 130804 | |
| l | 92477 | 5.0% |
| t | 85481 | 4.6% |
| Other values (14) | 246330 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 46146 | |
| S | 38065 | |
| T | 27401 | |
| U | 10261 | 6.4% |
| B | 6095 | 3.8% |
| P | 5926 | 3.7% |
| C | 5880 | 3.7% |
| O | 5094 | 3.2% |
| E | 3082 | 1.9% |
| R | 2197 | 1.4% |
| Other values (11) | 8950 | 5.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 9850 | |
| & | 157 | 1.6% |
| / | 89 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 127712 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2008227 | |
| Common | 137818 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 236706 | |
| s | 211809 | |
| e | 210870 | |
| n | 172401 | |
| o | 167894 | |
| c | 147453 | 7.3% |
| r | 146905 | 7.3% |
| d | 130804 | 6.5% |
| l | 92477 | 4.6% |
| t | 85481 | 4.3% |
| Other values (35) | 405427 |
Common
| Value | Count | Frequency (%) |
| 127712 | ||
| ; | 9850 | 7.1% |
| & | 157 | 0.1% |
| / | 89 | 0.1% |
| ( | 5 | < 0.1% |
| ) | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2146045 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 236706 | |
| s | 211809 | |
| e | 210870 | |
| n | 172401 | 8.0% |
| o | 167894 | 7.8% |
| c | 147453 | 6.9% |
| r | 146905 | 6.8% |
| d | 130804 | 6.1% |
| 127712 | 6.0% | |
| l | 92477 | 4.3% |
| Other values (41) | 501014 |
associatedMedia
Text
Missing 
| Distinct | 84848 |
|---|---|
| Distinct (%) | 97.2% |
| Missing | 637195 |
| Missing (%) | 87.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 1069 |
|---|---|
| Median length | 1059 |
| Mean length | 58.46043544 |
| Min length | 48 |
Unique
| Unique | 83728 ? |
|---|---|
| Unique (%) | 95.9% |
Sample
| 1st row | https://collections.nmnh.si.edu/media/?i=12688993 |
|---|---|
| 2nd row | https://collections.nmnh.si.edu/media/?i=12689748 |
| 3rd row | https://collections.nmnh.si.edu/media/?i=15308925 |
| 4th row | https://collections.nmnh.si.edu/media/?i=11098487 |
| 5th row | https://collections.nmnh.si.edu/media/?i=12770417; 12770964 |
| Value | Count | Frequency (%) |
| https://collections.nmnh.si.edu/media/?i=16189563 | 203 | 0.1% |
| https://collections.nmnh.si.edu/media/?i=16053361 | 170 | 0.1% |
| 10035032 | 87 | 0.1% |
| https://collections.nmnh.si.edu/media/?i=13958963 | 76 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=16647294 | 48 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=16725276 | 37 | < 0.1% |
| https://collections.nmnh.si.edu/media/?i=16115280 | 33 | < 0.1% |
| 10320533 | 30 | < 0.1% |
| 10320530 | 29 | < 0.1% |
| 10320532 | 26 | < 0.1% |
| Other values (167678) | 170293 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 349252 | 6.8% |
| / | 349252 | 6.8% |
| n | 261939 | 5.1% |
| s | 261939 | 5.1% |
| t | 261939 | 5.1% |
| . | 261939 | 5.1% |
| e | 261939 | 5.1% |
| 1 | 256693 | 5.0% |
| d | 174626 | 3.4% |
| m | 174626 | 3.4% |
| Other values (21) | 2490212 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2706703 | |
| Decimal Number | 1357085 | |
| Other Punctuation | 869536 | 17.0% |
| Math Symbol | 87313 | 1.7% |
| Space Separator | 83719 | 1.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 349252 | |
| n | 261939 | |
| s | 261939 | |
| t | 261939 | |
| e | 261939 | |
| d | 174626 | 6.5% |
| m | 174626 | 6.5% |
| h | 174626 | 6.5% |
| l | 174626 | 6.5% |
| o | 174626 | 6.5% |
| Other values (4) | 436565 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 256693 | |
| 2 | 156668 | |
| 8 | 152923 | |
| 0 | 142520 | |
| 7 | 132916 | |
| 4 | 117629 | |
| 6 | 106828 | |
| 3 | 103174 | |
| 9 | 97272 | 7.2% |
| 5 | 90462 | 6.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 349252 | |
| . | 261939 | |
| ? | 87313 | 10.0% |
| : | 87313 | 10.0% |
| ; | 83719 | 9.6% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 87313 |
Space Separator
| Value | Count | Frequency (%) |
| 83719 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2706703 | |
| Common | 2397653 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 349252 | |
| . | 261939 | |
| 1 | 256693 | |
| 2 | 156668 | 6.5% |
| 8 | 152923 | 6.4% |
| 0 | 142520 | 5.9% |
| 7 | 132916 | 5.5% |
| 4 | 117629 | 4.9% |
| 6 | 106828 | 4.5% |
| 3 | 103174 | 4.3% |
| Other values (7) | 617111 |
Latin
| Value | Count | Frequency (%) |
| i | 349252 | |
| n | 261939 | |
| s | 261939 | |
| t | 261939 | |
| e | 261939 | |
| d | 174626 | 6.5% |
| m | 174626 | 6.5% |
| h | 174626 | 6.5% |
| l | 174626 | 6.5% |
| o | 174626 | 6.5% |
| Other values (4) | 436565 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5104356 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 349252 | 6.8% |
| / | 349252 | 6.8% |
| n | 261939 | 5.1% |
| s | 261939 | 5.1% |
| t | 261939 | 5.1% |
| . | 261939 | 5.1% |
| e | 261939 | 5.1% |
| 1 | 256693 | 5.0% |
| d | 174626 | 3.4% |
| m | 174626 | 3.4% |
| Other values (21) | 2490212 |
Missing 
| Distinct | 38195 |
|---|---|
| Distinct (%) | 44.3% |
| Missing | 638259 |
| Missing (%) | 88.1% |
| Memory size | 5.5 MiB |
Length
| Max length | 1257 |
|---|---|
| Median length | 1240 |
| Mean length | 357.4557966 |
| Min length | 5 |
Unique
| Unique | 36384 ? |
|---|---|
| Unique (%) | 42.2% |
Sample
| 1st row | Specimen comments: Associated w/ #0343 and #0346. | Body size code: medium; Taphonomic Significance: Human modification | Features: Weathering, diagenesis: N/A; Burn Color: none; Burn Modification: none; Cut: 0; Scrape: 0; Chop: 0; Loading Notch: 0; Counterblow: 0; Anvil pit: 0; Carn pit: 0; Carn score: 0; Carn furrow: 0; Carn punct: 0; Carn crenulation: 0; Rodent gnaw: none |
|---|---|
| 2nd row | EMu record was created as part of the Smithsonian Institution Digitization Program Office (SI DPO) mass digitization pilot project to support the National Science Foundation Advancing Digitization of Biodiversity Collections Eastern Pacific Invertebrates of the Cenozoic Collaborative Thematic Collections Network (NSF ADBC EPICC TCN). The SI DPO mass digitization pilot workflow includes crowdsourced label transcription through the SI Transcription Center.; Information generated by NMNH Department of Paleobiology volunteers: Specimen count and preliminary identification to class. |
| 3rd row | EMu record was created as part of the Smithsonian Institution Digitization Program Office (SI DPO) mass digitization pilot project to support the National Science Foundation Advancing Digitization of Biodiversity Collections Eastern Pacific Invertebrates of the Cenozoic Collaborative Thematic Collections Network (NSF ADBC EPICC TCN). The SI DPO mass digitization pilot workflow includes crowdsourced label transcription through the SI Transcription Center.; Information generated by NMNH Department of Paleobiology volunteers: Specimen count and preliminary identification to class. |
| 4th row | The fossil is marked with the original Green River number and is often mistaken for the USNM number. That original Green River collection number is 75432.; Numbers associated with this fossil: 578683. 75432. 40193. |
| 5th row | EMu record was created as part of the Smithsonian Institution Digitization Program Office (SI DPO) mass digitization pilot project to support the National Science Foundation Advancing Digitization of Biodiversity Collections Eastern Pacific Invertebrates of the Cenozoic Collaborative Thematic Collections Network (NSF ADBC EPICC TCN). The SI DPO mass digitization pilot workflow includes crowdsourced label transcription through the SI Transcription Center.; Additional label information: This locality is at approximately the same horizon as USGS CENO LOC 5686, in which a shale fauna was collected | See USGS CENO LOC 5703; Verbatim Lithostratigraphy: Tejon Formation; Sandstone forming the upper member of the Tejon | Discontinuous lenses in a soft brownish sandstone, less than 100 feet stratigraphically below the overlying diatomaceous shale; Verbatim Chronostratigraphy: Eocene |
| Value | Count | Frequency (%) |
| the | 291111 | 6.9% |
| digitization | 174338 | 4.1% |
| of | 164357 | 3.9% |
| si | 100203 | 2.4% |
| collections | 99405 | 2.4% |
| number | 86263 | 2.0% |
| is | 85833 | 2.0% |
| mass | 74949 | 1.8% |
| dpo | 74947 | 1.8% |
| with | 57325 | 1.4% |
| Other values (66970) | 3009589 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4132071 | 13.4% | |
| i | 2608470 | 8.5% |
| t | 2311910 | 7.5% |
| o | 2139574 | 6.9% |
| e | 2129723 | 6.9% |
| n | 1708168 | 5.5% |
| a | 1671073 | 5.4% |
| r | 1554155 | 5.0% |
| s | 1249854 | 4.1% |
| c | 981043 | 3.2% |
| Other values (82) | 10344164 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22179429 | |
| Space Separator | 4132071 | 13.4% |
| Uppercase Letter | 3027854 | 9.8% |
| Decimal Number | 712264 | 2.3% |
| Other Punctuation | 536260 | 1.7% |
| Open Punctuation | 103223 | 0.3% |
| Close Punctuation | 103221 | 0.3% |
| Math Symbol | 26815 | 0.1% |
| Dash Punctuation | 8726 | < 0.1% |
| Connector Punctuation | 335 | < 0.1% |
| Other values (3) | 7 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2608470 | |
| t | 2311910 | |
| o | 2139574 | |
| e | 2129723 | |
| n | 1708168 | 7.7% |
| a | 1671073 | 7.5% |
| r | 1554155 | 7.0% |
| s | 1249854 | 5.6% |
| c | 981043 | 4.4% |
| l | 809850 | 3.7% |
| Other values (16) | 5015609 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 475177 | |
| S | 312569 | |
| N | 284886 | |
| I | 260808 | |
| P | 248493 | |
| D | 239558 | |
| T | 217566 | 7.2% |
| E | 157599 | 5.2% |
| A | 134747 | 4.5% |
| O | 129263 | 4.3% |
| Other values (16) | 567188 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 253963 | |
| : | 134709 | |
| ; | 123326 | |
| , | 10668 | 2.0% |
| / | 5315 | 1.0% |
| & | 3632 | 0.7% |
| ? | 1748 | 0.3% |
| " | 1387 | 0.3% |
| # | 984 | 0.2% |
| ' | 412 | 0.1% |
| Other values (5) | 116 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 96673 | |
| 5 | 95617 | |
| 0 | 89759 | |
| 4 | 70754 | |
| 2 | 67002 | |
| 7 | 66254 | |
| 8 | 64489 | |
| 6 | 57819 | |
| 3 | 52279 | |
| 9 | 51618 |
Math Symbol
| Value | Count | Frequency (%) |
| | | 24725 | |
| + | 1585 | 5.9% |
| > | 212 | 0.8% |
| < | 199 | 0.7% |
| = | 94 | 0.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 103206 | |
| [ | 17 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 103204 | |
| ] | 17 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 4132071 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 8726 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 335 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 4 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 2 |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25207283 | |
| Common | 5622922 | 18.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 2608470 | 10.3% |
| t | 2311910 | 9.2% |
| o | 2139574 | 8.5% |
| e | 2129723 | 8.4% |
| n | 1708168 | 6.8% |
| a | 1671073 | 6.6% |
| r | 1554155 | 6.2% |
| s | 1249854 | 5.0% |
| c | 981043 | 3.9% |
| l | 809850 | 3.2% |
| Other values (42) | 8043463 |
Common
| Value | Count | Frequency (%) |
| 4132071 | ||
| . | 253963 | 4.5% |
| : | 134709 | 2.4% |
| ; | 123326 | 2.2% |
| ( | 103206 | 1.8% |
| ) | 103204 | 1.8% |
| 1 | 96673 | 1.7% |
| 5 | 95617 | 1.7% |
| 0 | 89759 | 1.6% |
| 4 | 70754 | 1.3% |
| Other values (30) | 419640 | 7.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30830198 | |
| Punctuation | 7 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4132071 | 13.4% | |
| i | 2608470 | 8.5% |
| t | 2311910 | 7.5% |
| o | 2139574 | 6.9% |
| e | 2129723 | 6.9% |
| n | 1708168 | 5.5% |
| a | 1671073 | 5.4% |
| r | 1554155 | 5.0% |
| s | 1249854 | 4.1% |
| c | 981043 | 3.2% |
| Other values (79) | 10344157 |
Punctuation
| Value | Count | Frequency (%) |
| “ | 4 | |
| ” | 2 | |
| … | 1 | 14.3% |
fieldNumber
Text
Missing 
| Distinct | 1516 |
|---|---|
| Distinct (%) | 34.0% |
| Missing | 720044 |
| Missing (%) | 99.4% |
| Memory size | 5.5 MiB |
Length
| Max length | 209 |
|---|---|
| Median length | 45 |
| Mean length | 35.25537634 |
| Min length | 1 |
Unique
| Unique | 1229 ? |
|---|---|
| Unique (%) | 27.5% |
Sample
| 1st row | MTC-08009; MTC-08009B; MTC-08009B (A); MTC-08009B (B) |
|---|---|
| 2nd row | 217 |
| 3rd row | YP79-2 |
| 4th row | TDP31 |
| 5th row | 82-10; 82-19; 82-21; 82-22; 82-4; 82-6; 82-7 |
| Value | Count | Frequency (%) |
| 82-10 | 767 | 4.2% |
| 82-21 | 767 | 4.2% |
| 82-22 | 767 | 4.2% |
| 82-4 | 767 | 4.2% |
| 82-6 | 767 | 4.2% |
| 82-7 | 767 | 4.2% |
| 82-19 | 767 | 4.2% |
| mtc-04028dd | 329 | 1.8% |
| mtc-04028h | 329 | 1.8% |
| mtc-04028gg | 329 | 1.8% |
| Other values (1502) | 11759 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 18832 | |
| - | 15944 | |
| 2 | 14513 | |
| 13651 | 8.7% | |
| ; | 12694 | 8.1% |
| 8 | 11928 | 7.6% |
| C | 9870 | 6.3% |
| M | 9201 | 5.8% |
| T | 8674 | 5.5% |
| 4 | 7381 | 4.7% |
| Other values (62) | 34692 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 72021 | |
| Uppercase Letter | 40992 | |
| Dash Punctuation | 15944 | 10.1% |
| Space Separator | 13651 | 8.7% |
| Other Punctuation | 12856 | 8.2% |
| Lowercase Letter | 1716 | 1.1% |
| Close Punctuation | 100 | 0.1% |
| Open Punctuation | 100 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 290 | |
| a | 205 | |
| m | 201 | |
| e | 185 | |
| l | 159 | |
| p | 150 | |
| o | 130 | |
| t | 77 | 4.5% |
| r | 70 | 4.1% |
| i | 55 | 3.2% |
| Other values (16) | 194 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 9870 | |
| M | 9201 | |
| T | 8674 | |
| A | 1535 | 3.7% |
| G | 1513 | 3.7% |
| B | 1509 | 3.7% |
| E | 1291 | 3.1% |
| D | 1285 | 3.1% |
| F | 1161 | 2.8% |
| H | 1137 | 2.8% |
| Other values (15) | 3816 | 9.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 18832 | |
| 2 | 14513 | |
| 8 | 11928 | |
| 4 | 7381 | 10.2% |
| 1 | 6730 | 9.3% |
| 3 | 3699 | 5.1% |
| 5 | 3595 | 5.0% |
| 7 | 2000 | 2.8% |
| 9 | 1780 | 2.5% |
| 6 | 1563 | 2.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 12694 | |
| . | 62 | 0.5% |
| , | 49 | 0.4% |
| # | 34 | 0.3% |
| / | 10 | 0.1% |
| & | 4 | < 0.1% |
| ' | 3 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 15944 |
Space Separator
| Value | Count | Frequency (%) |
| 13651 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 100 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 100 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 114672 | |
| Latin | 42708 | 27.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 9870 | |
| M | 9201 | |
| T | 8674 | |
| A | 1535 | 3.6% |
| G | 1513 | 3.5% |
| B | 1509 | 3.5% |
| E | 1291 | 3.0% |
| D | 1285 | 3.0% |
| F | 1161 | 2.7% |
| H | 1137 | 2.7% |
| Other values (41) | 5532 |
Common
| Value | Count | Frequency (%) |
| 0 | 18832 | |
| - | 15944 | |
| 2 | 14513 | |
| 13651 | ||
| ; | 12694 | |
| 8 | 11928 | |
| 4 | 7381 | 6.4% |
| 1 | 6730 | 5.9% |
| 3 | 3699 | 3.2% |
| 5 | 3595 | 3.1% |
| Other values (11) | 5705 | 5.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 157380 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 18832 | |
| - | 15944 | |
| 2 | 14513 | |
| 13651 | 8.7% | |
| ; | 12694 | 8.1% |
| 8 | 11928 | 7.6% |
| C | 9870 | 6.3% |
| M | 9201 | 5.8% |
| T | 8674 | 5.5% |
| 4 | 7381 | 4.7% |
| Other values (62) | 34692 |
eventDate
Text
Missing 
| Distinct | 17617 |
|---|---|
| Distinct (%) | 6.5% |
| Missing | 453741 |
| Missing (%) | 62.6% |
| Memory size | 5.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 18 |
| Mean length | 7.649425521 |
| Min length | 4 |
Unique
| Unique | 5897 ? |
|---|---|
| Unique (%) | 2.2% |
Sample
| 1st row | 1985-01-23 |
|---|---|
| 2nd row | 1974 |
| 3rd row | 1980 |
| 4th row | 1963 |
| 5th row | 1956 |
| Value | Count | Frequency (%) |
| 1910/1917 | 6616 | 2.4% |
| 1991/1993 | 6310 | 2.3% |
| 1999 | 3773 | 1.4% |
| 1980 | 3739 | 1.4% |
| 1982 | 3572 | 1.3% |
| 1984-02 | 3350 | 1.2% |
| 1998 | 3319 | 1.2% |
| 1997 | 3308 | 1.2% |
| 1995 | 3121 | 1.2% |
| 2001 | 2926 | 1.1% |
| Other values (17607) | 230733 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 451090 | |
| 9 | 375304 | |
| - | 289583 | |
| 0 | 255834 | |
| 8 | 133815 | 6.5% |
| 7 | 127284 | 6.1% |
| 2 | 109700 | 5.3% |
| 6 | 89305 | 4.3% |
| 3 | 74141 | 3.6% |
| 4 | 71285 | 3.4% |
| Other values (3) | 93871 | 4.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1757750 | |
| Dash Punctuation | 289583 | 14.0% |
| Other Punctuation | 23879 | 1.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 451090 | |
| 9 | 375304 | |
| 0 | 255834 | |
| 8 | 133815 | 7.6% |
| 7 | 127284 | 7.2% |
| 2 | 109700 | 6.2% |
| 6 | 89305 | 5.1% |
| 3 | 74141 | 4.2% |
| 4 | 71285 | 4.1% |
| 5 | 69992 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 23877 | |
| , | 2 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 289583 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2071212 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 451090 | |
| 9 | 375304 | |
| - | 289583 | |
| 0 | 255834 | |
| 8 | 133815 | 6.5% |
| 7 | 127284 | 6.1% |
| 2 | 109700 | 5.3% |
| 6 | 89305 | 4.3% |
| 3 | 74141 | 3.6% |
| 4 | 71285 | 3.4% |
| Other values (3) | 93871 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2071212 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 451090 | |
| 9 | 375304 | |
| - | 289583 | |
| 0 | 255834 | |
| 8 | 133815 | 6.5% |
| 7 | 127284 | 6.1% |
| 2 | 109700 | 5.3% |
| 6 | 89305 | 4.3% |
| 3 | 74141 | 3.6% |
| 4 | 71285 | 3.4% |
| Other values (3) | 93871 | 4.5% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 571939 |
| Missing (%) | 78.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.836395336 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 23 |
|---|---|
| 2nd row | 267 |
| 3rd row | 230 |
| 4th row | 288 |
| 5th row | 100 |
| Value | Count | Frequency (%) |
| 60 | 3645 | 2.4% |
| 212 | 3066 | 2.0% |
| 243 | 2888 | 1.9% |
| 181 | 2290 | 1.5% |
| 151 | 2068 | 1.4% |
| 304 | 1900 | 1.2% |
| 213 | 1765 | 1.2% |
| 120 | 1640 | 1.1% |
| 273 | 1383 | 0.9% |
| 244 | 1217 | 0.8% |
| Other values (356) | 130707 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 95911 | |
| 1 | 86225 | |
| 3 | 48550 | |
| 0 | 34306 | 7.9% |
| 4 | 30194 | 7.0% |
| 9 | 29540 | 6.8% |
| 6 | 28135 | 6.5% |
| 5 | 27414 | 6.3% |
| 8 | 26265 | 6.1% |
| 7 | 26206 | 6.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 432746 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 95911 | |
| 1 | 86225 | |
| 3 | 48550 | |
| 0 | 34306 | 7.9% |
| 4 | 30194 | 7.0% |
| 9 | 29540 | 6.8% |
| 6 | 28135 | 6.5% |
| 5 | 27414 | 6.3% |
| 8 | 26265 | 6.1% |
| 7 | 26206 | 6.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 432746 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 95911 | |
| 1 | 86225 | |
| 3 | 48550 | |
| 0 | 34306 | 7.9% |
| 4 | 30194 | 7.0% |
| 9 | 29540 | 6.8% |
| 6 | 28135 | 6.5% |
| 5 | 27414 | 6.3% |
| 8 | 26265 | 6.1% |
| 7 | 26206 | 6.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 432746 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 95911 | |
| 1 | 86225 | |
| 3 | 48550 | |
| 0 | 34306 | 7.9% |
| 4 | 30194 | 7.0% |
| 9 | 29540 | 6.8% |
| 6 | 28135 | 6.5% |
| 5 | 27414 | 6.3% |
| 8 | 26265 | 6.1% |
| 7 | 26206 | 6.1% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 571953 |
| Missing (%) | 78.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.837606109 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 23 |
|---|---|
| 2nd row | 267 |
| 3rd row | 230 |
| 4th row | 288 |
| 5th row | 100 |
| Value | Count | Frequency (%) |
| 60 | 3687 | 2.4% |
| 243 | 3058 | 2.0% |
| 212 | 2958 | 1.9% |
| 151 | 2041 | 1.3% |
| 181 | 2016 | 1.3% |
| 304 | 1825 | 1.2% |
| 120 | 1813 | 1.2% |
| 213 | 1760 | 1.2% |
| 273 | 1430 | 0.9% |
| 244 | 1424 | 0.9% |
| Other values (356) | 130543 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 96077 | |
| 1 | 85473 | |
| 3 | 48226 | |
| 0 | 34296 | 7.9% |
| 4 | 30948 | 7.1% |
| 9 | 29109 | 6.7% |
| 6 | 28569 | 6.6% |
| 5 | 27645 | 6.4% |
| 7 | 26568 | 6.1% |
| 8 | 25980 | 6.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 432891 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 96077 | |
| 1 | 85473 | |
| 3 | 48226 | |
| 0 | 34296 | 7.9% |
| 4 | 30948 | 7.1% |
| 9 | 29109 | 6.7% |
| 6 | 28569 | 6.6% |
| 5 | 27645 | 6.4% |
| 7 | 26568 | 6.1% |
| 8 | 25980 | 6.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 432891 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 96077 | |
| 1 | 85473 | |
| 3 | 48226 | |
| 0 | 34296 | 7.9% |
| 4 | 30948 | 7.1% |
| 9 | 29109 | 6.7% |
| 6 | 28569 | 6.6% |
| 5 | 27645 | 6.4% |
| 7 | 26568 | 6.1% |
| 8 | 25980 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 432891 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 96077 | |
| 1 | 85473 | |
| 3 | 48226 | |
| 0 | 34296 | 7.9% |
| 4 | 30948 | 7.1% |
| 9 | 29109 | 6.7% |
| 6 | 28569 | 6.6% |
| 5 | 27645 | 6.4% |
| 7 | 26568 | 6.1% |
| 8 | 25980 | 6.0% |
year
Text
Missing 
| Distinct | 191 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 453741 |
| Missing (%) | 62.6% |
| Memory size | 5.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1985 |
|---|---|
| 2nd row | 1974 |
| 3rd row | 1980 |
| 4th row | 1963 |
| 5th row | 1956 |
| Value | Count | Frequency (%) |
| 1910 | 7846 | 2.9% |
| 1991 | 7769 | 2.9% |
| 1980 | 7431 | 2.7% |
| 1981 | 7192 | 2.7% |
| 1982 | 7174 | 2.6% |
| 1971 | 6769 | 2.5% |
| 1976 | 6488 | 2.4% |
| 1964 | 5815 | 2.1% |
| 1973 | 5778 | 2.1% |
| 1984 | 5612 | 2.1% |
| Other values (181) | 202893 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 322145 | |
| 9 | 319500 | |
| 8 | 89146 | 8.2% |
| 7 | 77505 | 7.2% |
| 6 | 58473 | 5.4% |
| 0 | 54161 | 5.0% |
| 4 | 44737 | 4.1% |
| 5 | 40639 | 3.8% |
| 2 | 38510 | 3.6% |
| 3 | 38252 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1083068 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 322145 | |
| 9 | 319500 | |
| 8 | 89146 | 8.2% |
| 7 | 77505 | 7.2% |
| 6 | 58473 | 5.4% |
| 0 | 54161 | 5.0% |
| 4 | 44737 | 4.1% |
| 5 | 40639 | 3.8% |
| 2 | 38510 | 3.6% |
| 3 | 38252 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1083068 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 322145 | |
| 9 | 319500 | |
| 8 | 89146 | 8.2% |
| 7 | 77505 | 7.2% |
| 6 | 58473 | 5.4% |
| 0 | 54161 | 5.0% |
| 4 | 44737 | 4.1% |
| 5 | 40639 | 3.8% |
| 2 | 38510 | 3.6% |
| 3 | 38252 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1083068 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 322145 | |
| 9 | 319500 | |
| 8 | 89146 | 8.2% |
| 7 | 77505 | 7.2% |
| 6 | 58473 | 5.4% |
| 0 | 54161 | 5.0% |
| 4 | 44737 | 4.1% |
| 5 | 40639 | 3.8% |
| 2 | 38510 | 3.6% |
| 3 | 38252 | 3.5% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 571556 |
| Missing (%) | 78.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.158729536 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 9 |
| 3rd row | 8 |
| 4th row | 10 |
| 5th row | 4 |
| Value | Count | Frequency (%) |
| 8 | 25708 | |
| 7 | 25619 | |
| 6 | 15211 | |
| 5 | 14666 | |
| 10 | 14523 | |
| 9 | 14275 | |
| 4 | 11358 | |
| 2 | 8535 | 5.6% |
| 3 | 8472 | 5.5% |
| 11 | 6678 | 4.4% |
| Other values (2) | 7907 | 5.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 35786 | |
| 8 | 25708 | |
| 7 | 25619 | |
| 6 | 15211 | |
| 5 | 14666 | |
| 0 | 14523 | |
| 9 | 14275 | 8.1% |
| 2 | 11612 | 6.6% |
| 4 | 11358 | 6.4% |
| 3 | 8472 | 4.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 177230 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 35786 | |
| 8 | 25708 | |
| 7 | 25619 | |
| 6 | 15211 | |
| 5 | 14666 | |
| 0 | 14523 | |
| 9 | 14275 | 8.1% |
| 2 | 11612 | 6.6% |
| 4 | 11358 | 6.4% |
| 3 | 8472 | 4.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 177230 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 35786 | |
| 8 | 25708 | |
| 7 | 25619 | |
| 6 | 15211 | |
| 5 | 14666 | |
| 0 | 14523 | |
| 9 | 14275 | 8.1% |
| 2 | 11612 | 6.6% |
| 4 | 11358 | 6.4% |
| 3 | 8472 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 177230 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 35786 | |
| 8 | 25708 | |
| 7 | 25619 | |
| 6 | 15211 | |
| 5 | 14666 | |
| 0 | 14523 | |
| 9 | 14275 | 8.1% |
| 2 | 11612 | 6.6% |
| 4 | 11358 | 6.4% |
| 3 | 8472 | 4.8% |
day
Text
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 593848 |
| Missing (%) | 82.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.719868361 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 23 |
|---|---|
| 2nd row | 24 |
| 3rd row | 18 |
| 4th row | 14 |
| 5th row | 9 |
| Value | Count | Frequency (%) |
| 17 | 5517 | 4.2% |
| 16 | 5029 | 3.8% |
| 18 | 5015 | 3.8% |
| 13 | 4668 | 3.6% |
| 23 | 4653 | 3.6% |
| 14 | 4622 | 3.5% |
| 20 | 4591 | 3.5% |
| 8 | 4550 | 3.5% |
| 15 | 4473 | 3.4% |
| 11 | 4420 | 3.4% |
| Other values (21) | 83122 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 61429 | |
| 2 | 53857 | |
| 3 | 19502 | 8.7% |
| 7 | 13732 | 6.1% |
| 8 | 13721 | 6.1% |
| 6 | 13069 | 5.8% |
| 0 | 12986 | 5.8% |
| 4 | 12423 | 5.5% |
| 9 | 12062 | 5.4% |
| 5 | 11937 | 5.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 224718 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 61429 | |
| 2 | 53857 | |
| 3 | 19502 | 8.7% |
| 7 | 13732 | 6.1% |
| 8 | 13721 | 6.1% |
| 6 | 13069 | 5.8% |
| 0 | 12986 | 5.8% |
| 4 | 12423 | 5.5% |
| 9 | 12062 | 5.4% |
| 5 | 11937 | 5.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 224718 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 61429 | |
| 2 | 53857 | |
| 3 | 19502 | 8.7% |
| 7 | 13732 | 6.1% |
| 8 | 13721 | 6.1% |
| 6 | 13069 | 5.8% |
| 0 | 12986 | 5.8% |
| 4 | 12423 | 5.5% |
| 9 | 12062 | 5.4% |
| 5 | 11937 | 5.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 224718 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 61429 | |
| 2 | 53857 | |
| 3 | 19502 | 8.7% |
| 7 | 13732 | 6.1% |
| 8 | 13721 | 6.1% |
| 6 | 13069 | 5.8% |
| 0 | 12986 | 5.8% |
| 4 | 12423 | 5.5% |
| 9 | 12062 | 5.4% |
| 5 | 11937 | 5.3% |
Missing 
| Distinct | 17805 |
|---|---|
| Distinct (%) | 6.4% |
| Missing | 445814 |
| Missing (%) | 61.5% |
| Memory size | 5.5 MiB |
Length
| Max length | 61 |
|---|---|
| Median length | 11 |
| Mean length | 11.41229808 |
| Min length | 4 |
Unique
| Unique | 5871 ? |
|---|---|
| Unique (%) | 2.1% |
Sample
| 1st row | 23 JAN 1985 |
|---|---|
| 2nd row | April, 1928 |
| 3rd row | -- --- 1980 |
| 4th row | -- --- 1963 |
| 5th row | -- --- 1956 |
| Value | Count | Frequency (%) |
| 235730 | ||
| aug | 23677 | 2.9% |
| jul | 22916 | 2.8% |
| summer | 20031 | 2.5% |
| jun | 14619 | 1.8% |
| may | 14325 | 1.8% |
| oct | 14287 | 1.7% |
| to | 13955 | 1.7% |
| sep | 13176 | 1.6% |
| apr | 10764 | 1.3% |
| Other values (1210) | 433163 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 633590 | |
| 537949 | ||
| 1 | 382844 | |
| 9 | 314473 | |
| 8 | 105770 | 3.3% |
| 0 | 101858 | 3.2% |
| 7 | 96225 | 3.0% |
| 2 | 94879 | 3.0% |
| 6 | 69663 | 2.2% |
| A | 63864 | 2.0% |
| Other values (59) | 779424 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1340357 | |
| Dash Punctuation | 633590 | |
| Space Separator | 537949 | |
| Uppercase Letter | 491521 | 15.5% |
| Lowercase Letter | 169648 | 5.3% |
| Other Punctuation | 6422 | 0.2% |
| Math Symbol | 1026 | < 0.1% |
| Open Punctuation | 13 | < 0.1% |
| Close Punctuation | 13 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| m | 40530 | |
| u | 32141 | |
| e | 26707 | |
| r | 24584 | |
| t | 7049 | 4.2% |
| a | 5225 | 3.1% |
| l | 4565 | 2.7% |
| g | 3709 | 2.2% |
| n | 3604 | 2.1% |
| p | 3590 | 2.1% |
| Other values (13) | 17944 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 63864 | |
| U | 61193 | |
| J | 48266 | 9.8% |
| O | 36480 | 7.4% |
| S | 35414 | 7.2% |
| T | 28143 | 5.7% |
| N | 24509 | 5.0% |
| P | 23974 | 4.9% |
| E | 23721 | 4.8% |
| G | 23661 | 4.8% |
| Other values (11) | 122296 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 382844 | |
| 9 | 314473 | |
| 8 | 105770 | 7.9% |
| 0 | 101858 | 7.6% |
| 7 | 96225 | 7.2% |
| 2 | 94879 | 7.1% |
| 6 | 69663 | 5.2% |
| 3 | 60386 | 4.5% |
| 4 | 58552 | 4.4% |
| 5 | 55707 | 4.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3733 | |
| . | 1309 | 20.4% |
| ' | 650 | 10.1% |
| / | 634 | 9.9% |
| ? | 92 | 1.4% |
| ; | 2 | < 0.1% |
| & | 1 | < 0.1% |
| * | 1 | < 0.1% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 1017 | |
| + | 5 | 0.5% |
| ~ | 4 | 0.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 633590 |
Space Separator
| Value | Count | Frequency (%) |
| 537949 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 13 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 13 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2519370 | |
| Latin | 661169 | 20.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 63864 | 9.7% |
| U | 61193 | 9.3% |
| J | 48266 | 7.3% |
| m | 40530 | 6.1% |
| O | 36480 | 5.5% |
| S | 35414 | 5.4% |
| u | 32141 | 4.9% |
| T | 28143 | 4.3% |
| e | 26707 | 4.0% |
| r | 24584 | 3.7% |
| Other values (34) | 263847 |
Common
| Value | Count | Frequency (%) |
| - | 633590 | |
| 537949 | ||
| 1 | 382844 | |
| 9 | 314473 | |
| 8 | 105770 | 4.2% |
| 0 | 101858 | 4.0% |
| 7 | 96225 | 3.8% |
| 2 | 94879 | 3.8% |
| 6 | 69663 | 2.8% |
| 3 | 60386 | 2.4% |
| Other values (15) | 121733 | 4.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3180539 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 633590 | |
| 537949 | ||
| 1 | 382844 | |
| 9 | 314473 | |
| 8 | 105770 | 3.3% |
| 0 | 101858 | 3.2% |
| 7 | 96225 | 3.0% |
| 2 | 94879 | 3.0% |
| 6 | 69663 | 2.2% |
| A | 63864 | 2.0% |
| Other values (59) | 779424 |
locationID
Text
Missing 
| Distinct | 66560 |
|---|---|
| Distinct (%) | 17.1% |
| Missing | 335037 |
| Missing (%) | 46.2% |
| Memory size | 5.5 MiB |
Length
| Max length | 61 |
|---|---|
| Median length | 59 |
| Mean length | 5.757204002 |
| Min length | 1 |
Unique
| Unique | 40451 ? |
|---|---|
| Unique (%) | 10.4% |
Sample
| 1st row | 1612 |
|---|---|
| 2nd row | 06 |
| 3rd row | USGS LOC M533 |
| 4th row | 42246 |
| 5th row | 707A |
| Value | Count | Frequency (%) |
| 42246 | 30863 | 6.4% |
| 35k | 30551 | 6.3% |
| loc | 19929 | 4.1% |
| sta | 7656 | 1.6% |
| d | 5640 | 1.2% |
| site | 4020 | 0.8% |
| 40193 | 3269 | 0.7% |
| leg | 3132 | 0.7% |
| olson | 2904 | 0.6% |
| 41142 | 2897 | 0.6% |
| Other values (59519) | 370823 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 252324 | 11.3% |
| 1 | 209625 | 9.3% |
| 4 | 194523 | 8.7% |
| 3 | 152357 | 6.8% |
| 0 | 140257 | 6.3% |
| 5 | 136706 | 6.1% |
| 6 | 130433 | 5.8% |
| 7 | 107242 | 4.8% |
| 8 | 99787 | 4.5% |
| 9 | 93127 | 4.2% |
| Other values (71) | 725883 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1516381 | |
| Uppercase Letter | 531863 | 23.7% |
| Space Separator | 92213 | 4.1% |
| Dash Punctuation | 52032 | 2.3% |
| Other Punctuation | 28932 | 1.3% |
| Lowercase Letter | 15132 | 0.7% |
| Math Symbol | 3062 | 0.1% |
| Close Punctuation | 1336 | 0.1% |
| Open Punctuation | 1313 | 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 51448 | 9.7% |
| L | 50984 | 9.6% |
| C | 46019 | 8.7% |
| S | 44241 | 8.3% |
| A | 41228 | 7.8% |
| E | 37168 | 7.0% |
| K | 36506 | 6.9% |
| T | 30011 | 5.6% |
| I | 25951 | 4.9% |
| N | 20969 | 3.9% |
| Other values (16) | 147338 |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2360 | |
| a | 1816 | |
| g | 1802 | |
| t | 1447 | |
| o | 1201 | |
| c | 1136 | |
| i | 1026 | |
| s | 789 | 5.2% |
| b | 707 | 4.7% |
| n | 562 | 3.7% |
| Other values (16) | 2286 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 13863 | |
| , | 10529 | |
| * | 2055 | 7.1% |
| / | 1776 | 6.1% |
| ' | 442 | 1.5% |
| # | 178 | 0.6% |
| ; | 41 | 0.1% |
| ? | 34 | 0.1% |
| : | 7 | < 0.1% |
| " | 6 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 252324 | |
| 1 | 209625 | |
| 4 | 194523 | |
| 3 | 152357 | |
| 0 | 140257 | |
| 5 | 136706 | |
| 6 | 130433 | |
| 7 | 107242 | |
| 8 | 99787 | 6.6% |
| 9 | 93127 | 6.1% |
Math Symbol
| Value | Count | Frequency (%) |
| + | 3039 | |
| = | 23 | 0.8% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1335 | |
| ] | 1 | 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1304 | |
| [ | 9 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 92213 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 52032 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1695269 | |
| Latin | 546995 | 24.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 51448 | 9.4% |
| L | 50984 | 9.3% |
| C | 46019 | 8.4% |
| S | 44241 | 8.1% |
| A | 41228 | 7.5% |
| E | 37168 | 6.8% |
| K | 36506 | 6.7% |
| T | 30011 | 5.5% |
| I | 25951 | 4.7% |
| N | 20969 | 3.8% |
| Other values (42) | 162470 |
Common
| Value | Count | Frequency (%) |
| 2 | 252324 | |
| 1 | 209625 | |
| 4 | 194523 | |
| 3 | 152357 | |
| 0 | 140257 | |
| 5 | 136706 | |
| 6 | 130433 | |
| 7 | 107242 | |
| 8 | 99787 | 5.9% |
| 9 | 93127 | 5.5% |
| Other values (19) | 178888 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2242264 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 252324 | 11.3% |
| 1 | 209625 | 9.3% |
| 4 | 194523 | 8.7% |
| 3 | 152357 | 6.8% |
| 0 | 140257 | 6.3% |
| 5 | 136706 | 6.1% |
| 6 | 130433 | 5.8% |
| 7 | 107242 | 4.8% |
| 8 | 99787 | 4.5% |
| 9 | 93127 | 4.2% |
| Other values (71) | 725883 |
higherGeography
Text
Missing 
| Distinct | 4708 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 148417 |
| Missing (%) | 20.5% |
| Memory size | 5.5 MiB |
Length
| Max length | 111 |
|---|---|
| Median length | 97 |
| Mean length | 42.17362361 |
| Min length | 4 |
Unique
| Unique | 1213 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | North America, United States, Florida |
|---|---|
| 2nd row | Africa, Kenya, Marsabit |
| 3rd row | North America, United States, Nevada, Pershing County |
| 4th row | Cuba, Camaguey Prov |
| 5th row | North America, United States, North Carolina, Beaufort County |
| Value | Count | Frequency (%) |
| north | 537307 | |
| america | 480121 | |
| united | 421781 | |
| states | 421705 | |
| county | 259124 | 7.9% |
| carolina | 46843 | 1.4% |
| canada | 38942 | 1.2% |
| texas | 38273 | 1.2% |
| colorado | 35917 | 1.1% |
| beaufort | 33680 | 1.0% |
| Other values (2951) | 959718 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2697320 | 11.1% | |
| t | 2343978 | 9.6% |
| a | 2051368 | 8.4% |
| e | 1823223 | 7.5% |
| i | 1571709 | 6.5% |
| r | 1497295 | 6.2% |
| o | 1387848 | 5.7% |
| , | 1279367 | 5.3% |
| n | 1260166 | 5.2% |
| s | 766919 | 3.2% |
| Other values (58) | 7616652 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17040948 | |
| Uppercase Letter | 3272221 | 13.5% |
| Space Separator | 2697320 | 11.1% |
| Other Punctuation | 1284183 | 5.3% |
| Dash Punctuation | 1169 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2343978 | |
| a | 2051368 | |
| e | 1823223 | |
| i | 1571709 | |
| r | 1497295 | |
| o | 1387848 | |
| n | 1260166 | |
| s | 766919 | 4.5% |
| h | 662498 | 3.9% |
| c | 650930 | 3.8% |
| Other values (24) | 3025014 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 590551 | |
| A | 571156 | |
| C | 498307 | |
| S | 484309 | |
| U | 430602 | |
| B | 108340 | 3.3% |
| M | 87750 | 2.7% |
| O | 60025 | 1.8% |
| T | 59534 | 1.8% |
| P | 52139 | 1.6% |
| Other values (16) | 329508 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1279367 | |
| . | 3038 | 0.2% |
| ' | 1757 | 0.1% |
| ? | 21 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2697320 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1169 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20313169 | |
| Common | 3982676 | 16.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 2343978 | 11.5% |
| a | 2051368 | 10.1% |
| e | 1823223 | 9.0% |
| i | 1571709 | 7.7% |
| r | 1497295 | 7.4% |
| o | 1387848 | 6.8% |
| n | 1260166 | 6.2% |
| s | 766919 | 3.8% |
| h | 662498 | 3.3% |
| c | 650930 | 3.2% |
| Other values (50) | 6297235 |
Common
| Value | Count | Frequency (%) |
| 2697320 | ||
| , | 1279367 | |
| . | 3038 | 0.1% |
| ' | 1757 | < 0.1% |
| - | 1169 | < 0.1% |
| ? | 21 | < 0.1% |
| ( | 2 | < 0.1% |
| ) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24288672 | |
| None | 7173 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2697320 | 11.1% | |
| t | 2343978 | 9.7% |
| a | 2051368 | 8.4% |
| e | 1823223 | 7.5% |
| i | 1571709 | 6.5% |
| r | 1497295 | 6.2% |
| o | 1387848 | 5.7% |
| , | 1279367 | 5.3% |
| n | 1260166 | 5.2% |
| s | 766919 | 3.2% |
| Other values (50) | 7609479 |
None
| Value | Count | Frequency (%) |
| ó | 3473 | |
| í | 2116 | |
| á | 1037 | 14.5% |
| é | 539 | 7.5% |
| ñ | 4 | 0.1% |
| è | 2 | < 0.1% |
| ä | 1 | < 0.1% |
| ú | 1 | < 0.1% |
continent
Text
Missing 
| Distinct | 44 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 210428 |
| Missing (%) | 29.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 13 |
| Mean length | 13.19896709 |
| Min length | 4 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | North America |
|---|---|
| 2nd row | Africa |
| 3rd row | North America |
| 4th row | North America |
| 5th row | North America |
| Value | Count | Frequency (%) |
| north | 491990 | |
| america | 480118 | |
| ocean | 26667 | 2.6% |
| atlantic | 13621 | 1.3% |
| south | 9893 | 0.9% |
| pacific | 8356 | 0.8% |
| indian | 4034 | 0.4% |
| africa | 3468 | 0.3% |
| oceania | 2870 | 0.3% |
| europe | 1626 | 0.2% |
| Other values (7) | 1509 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| r | 977899 | |
| c | 544584 | |
| a | 542896 | |
| 530072 | ||
| t | 529855 | |
| i | 522205 | |
| e | 511408 | |
| o | 503636 | |
| h | 502009 | |
| A | 498588 | |
| Other values (16) | 1122173 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5209576 | |
| Uppercase Letter | 1044152 | 15.4% |
| Space Separator | 530072 | 7.8% |
| Other Punctuation | 1525 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 977899 | |
| c | 544584 | |
| a | 542896 | |
| t | 529855 | |
| i | 522205 | |
| e | 511408 | |
| o | 503636 | |
| h | 502009 | |
| m | 480119 | |
| n | 51386 | 1.0% |
| Other values (6) | 43579 | 0.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 498588 | |
| N | 491990 | |
| O | 29537 | 2.8% |
| S | 10020 | 1.0% |
| P | 8356 | 0.8% |
| I | 4034 | 0.4% |
| E | 1626 | 0.2% |
| T | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 530072 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1525 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6253728 | |
| Common | 531597 | 7.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 977899 | |
| c | 544584 | |
| a | 542896 | |
| t | 529855 | |
| i | 522205 | |
| e | 511408 | |
| o | 503636 | |
| h | 502009 | |
| A | 498588 | |
| N | 491990 | |
| Other values (14) | 628658 |
Common
| Value | Count | Frequency (%) |
| 530072 | ||
| , | 1525 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6785325 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| r | 977899 | |
| c | 544584 | |
| a | 542896 | |
| 530072 | ||
| t | 529855 | |
| i | 522205 | |
| e | 511408 | |
| o | 503636 | |
| h | 502009 | |
| A | 498588 | |
| Other values (16) | 1122173 |
waterBody
Text
Missing 
| Distinct | 172 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 696851 |
| Missing (%) | 96.2% |
| Memory size | 5.5 MiB |
Length
| Max length | 61 |
|---|---|
| Median length | 54 |
| Mean length | 21.95758759 |
| Min length | 8 |
Unique
| Unique | 58 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | North Atlantic Ocean |
|---|---|
| 2nd row | North Pacific Ocean |
| 3rd row | North Atlantic Ocean, Caribbean Sea |
| 4th row | North Atlantic Ocean |
| 5th row | North Atlantic Ocean |
| Value | Count | Frequency (%) |
| ocean | 26667 | |
| north | 18835 | |
| atlantic | 13621 | |
| pacific | 8356 | 8.8% |
| sea | 5778 | 6.1% |
| indian | 4034 | 4.3% |
| south | 2993 | 3.2% |
| timor | 2479 | 2.6% |
| of | 2181 | 2.3% |
| gulf | 2067 | 2.2% |
| Other values (146) | 7758 | 8.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 67112 | ||
| a | 66029 | |
| c | 60399 | |
| n | 52729 | 8.7% |
| t | 51240 | 8.4% |
| i | 42959 | 7.1% |
| e | 39252 | 6.5% |
| o | 28732 | 4.7% |
| O | 27050 | 4.5% |
| r | 26329 | 4.3% |
| Other values (39) | 145450 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 439588 | |
| Uppercase Letter | 92948 | 15.3% |
| Space Separator | 67112 | 11.1% |
| Other Punctuation | 7633 | 1.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 66029 | |
| c | 60399 | |
| n | 52729 | |
| t | 51240 | |
| i | 42959 | |
| e | 39252 | |
| o | 28732 | |
| r | 26329 | 6.0% |
| h | 22202 | 5.1% |
| l | 16619 | 3.8% |
| Other values (15) | 33098 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 27050 | |
| N | 18947 | |
| A | 14632 | |
| S | 9530 | 10.3% |
| P | 8558 | 9.2% |
| I | 4100 | 4.4% |
| M | 2579 | 2.8% |
| T | 2567 | 2.8% |
| G | 2317 | 2.5% |
| C | 1788 | 1.9% |
| Other values (12) | 880 | 0.9% |
Space Separator
| Value | Count | Frequency (%) |
| 67112 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 7633 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 532536 | |
| Common | 74745 | 12.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 66029 | |
| c | 60399 | |
| n | 52729 | |
| t | 51240 | |
| i | 42959 | 8.1% |
| e | 39252 | 7.4% |
| o | 28732 | 5.4% |
| O | 27050 | 5.1% |
| r | 26329 | 4.9% |
| h | 22202 | 4.2% |
| Other values (37) | 115615 |
Common
| Value | Count | Frequency (%) |
| 67112 | ||
| , | 7633 | 10.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 607281 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 67112 | ||
| a | 66029 | |
| c | 60399 | |
| n | 52729 | 8.7% |
| t | 51240 | 8.4% |
| i | 42959 | 7.1% |
| e | 39252 | 6.5% |
| o | 28732 | 4.7% |
| O | 27050 | 4.5% |
| r | 26329 | 4.3% |
| Other values (39) | 145450 |
islandGroup
Text
Missing 
| Distinct | 33 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 723710 |
| Missing (%) | 99.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 24 |
| Mean length | 16.78571429 |
| Min length | 5 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | 1.6% |
Sample
| 1st row | Mariana Islands |
|---|---|
| 2nd row | Northern Mariana Islands |
| 3rd row | Gilbert Islands |
| 4th row | Gilbert Islands |
| 5th row | Aleutian Islands |
| Value | Count | Frequency (%) |
| islands | 765 | |
| marshall | 241 | 14.0% |
| mariana | 155 | 9.0% |
| gilbert | 135 | 7.9% |
| northern | 134 | 7.8% |
| marianas | 120 | 7.0% |
| solomon | 21 | 1.2% |
| ryukyu | 18 | 1.0% |
| hawaiian | 18 | 1.0% |
| antilles | 15 | 0.9% |
| Other values (26) | 97 | 5.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2202 | |
| s | 1936 | |
| l | 1461 | |
| n | 1270 | |
| r | 960 | |
| 921 | ||
| d | 800 | 6.0% |
| I | 765 | 5.7% |
| M | 527 | 3.9% |
| i | 498 | 3.7% |
| Other values (36) | 2055 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10752 | |
| Uppercase Letter | 1720 | 12.8% |
| Space Separator | 921 | 6.9% |
| Other Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2202 | |
| s | 1936 | |
| l | 1461 | |
| n | 1270 | |
| r | 960 | |
| d | 800 | 7.4% |
| i | 498 | 4.6% |
| h | 376 | 3.5% |
| e | 374 | 3.5% |
| t | 298 | 2.8% |
| Other values (13) | 577 | 5.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 765 | |
| M | 527 | |
| N | 140 | 8.1% |
| G | 135 | 7.8% |
| A | 25 | 1.5% |
| L | 24 | 1.4% |
| S | 24 | 1.4% |
| H | 18 | 1.0% |
| R | 18 | 1.0% |
| C | 11 | 0.6% |
| Other values (11) | 33 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 921 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12472 | |
| Common | 923 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2202 | |
| s | 1936 | |
| l | 1461 | |
| n | 1270 | |
| r | 960 | |
| d | 800 | 6.4% |
| I | 765 | 6.1% |
| M | 527 | 4.2% |
| i | 498 | 4.0% |
| h | 376 | 3.0% |
| Other values (34) | 1677 |
Common
| Value | Count | Frequency (%) |
| 921 | ||
| . | 2 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13395 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2202 | |
| s | 1936 | |
| l | 1461 | |
| n | 1270 | |
| r | 960 | |
| 921 | ||
| d | 800 | 6.0% |
| I | 765 | 5.7% |
| M | 527 | 3.9% |
| i | 498 | 3.7% |
| Other values (36) | 2055 |
island
Text
Missing 
| Distinct | 87 |
|---|---|
| Distinct (%) | 0.9% |
| Missing | 714401 |
| Missing (%) | 98.6% |
| Memory size | 5.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 4 |
| Mean length | 6.015335906 |
| Min length | 3 |
Unique
| Unique | 38 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Oahu |
|---|---|
| 2nd row | Oahu |
| 3rd row | Oahu |
| 4th row | Animasola Island |
| 5th row | Molokai |
| Value | Count | Frequency (%) |
| oahu | 5926 | |
| molokai | 2218 | 19.1% |
| saint | 944 | 8.1% |
| helena | 938 | 8.1% |
| atoll | 241 | 2.1% |
| saipan | 132 | 1.1% |
| guam | 129 | 1.1% |
| onotoa | 116 | 1.0% |
| martha's | 108 | 0.9% |
| vineyard | 108 | 0.9% |
| Other values (91) | 728 | 6.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 11360 | |
| u | 6232 | |
| h | 6099 | |
| O | 6043 | |
| o | 5165 | |
| i | 4062 | 6.7% |
| l | 3813 | 6.3% |
| n | 2689 | 4.4% |
| k | 2476 | 4.1% |
| M | 2342 | 3.9% |
| Other values (40) | 10516 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 47612 | |
| Uppercase Letter | 11591 | 19.1% |
| Space Separator | 1481 | 2.4% |
| Other Punctuation | 109 | 0.2% |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 11360 | |
| u | 6232 | |
| h | 6099 | |
| o | 5165 | |
| i | 4062 | 8.5% |
| l | 3813 | 8.0% |
| n | 2689 | 5.6% |
| k | 2476 | 5.2% |
| e | 2309 | 4.8% |
| t | 1709 | 3.6% |
| Other values (16) | 1698 | 3.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 6043 | |
| M | 2342 | 20.2% |
| S | 1177 | 10.2% |
| H | 941 | 8.1% |
| A | 273 | 2.4% |
| G | 140 | 1.2% |
| B | 138 | 1.2% |
| E | 125 | 1.1% |
| V | 121 | 1.0% |
| I | 89 | 0.8% |
| Other values (11) | 202 | 1.7% |
Space Separator
| Value | Count | Frequency (%) |
| 1481 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 109 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 59203 | |
| Common | 1594 | 2.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 11360 | |
| u | 6232 | |
| h | 6099 | |
| O | 6043 | |
| o | 5165 | |
| i | 4062 | 6.9% |
| l | 3813 | 6.4% |
| n | 2689 | 4.5% |
| k | 2476 | 4.2% |
| M | 2342 | 4.0% |
| Other values (37) | 8922 |
Common
| Value | Count | Frequency (%) |
| 1481 | ||
| ' | 109 | 6.8% |
| - | 4 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 60794 | |
| None | 3 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 11360 | |
| u | 6232 | |
| h | 6099 | |
| O | 6043 | |
| o | 5165 | |
| i | 4062 | 6.7% |
| l | 3813 | 6.3% |
| n | 2689 | 4.4% |
| k | 2476 | 4.1% |
| M | 2342 | 3.9% |
| Other values (38) | 10513 |
None
| Value | Count | Frequency (%) |
| ñ | 2 | |
| é | 1 |
country
Text
Missing 
| Distinct | 227 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 173269 |
| Missing (%) | 23.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 44 |
|---|---|
| Median length | 13 |
| Mean length | 11.8822108 |
| Min length | 4 |
Unique
| Unique | 39 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | United States |
|---|---|
| 2nd row | Kenya |
| 3rd row | United States |
| 4th row | Cuba |
| 5th row | United States |
| Value | Count | Frequency (%) |
| united | 421781 | |
| states | 421705 | |
| canada | 38942 | 3.9% |
| panama | 8607 | 0.9% |
| republic | 6480 | 0.6% |
| dominican | 6290 | 0.6% |
| islands | 4307 | 0.4% |
| mexico | 3812 | 0.4% |
| colombia | 3579 | 0.4% |
| france | 3529 | 0.4% |
| Other values (228) | 84524 | 8.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 1291649 | |
| e | 891107 | |
| a | 672519 | |
| n | 536738 | |
| i | 496752 | 7.6% |
| d | 485872 | 7.4% |
| s | 453446 | 6.9% |
| 452317 | 6.9% | |
| S | 427898 | 6.5% |
| U | 422899 | 6.5% |
| Other values (47) | 418741 | 6.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5095180 | |
| Uppercase Letter | 1001497 | 15.3% |
| Space Separator | 452317 | 6.9% |
| Other Punctuation | 942 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1291649 | |
| e | 891107 | |
| a | 672519 | |
| n | 536738 | |
| i | 496752 | 9.7% |
| d | 485872 | 9.5% |
| s | 453446 | 8.9% |
| c | 41278 | 0.8% |
| l | 37380 | 0.7% |
| o | 35955 | 0.7% |
| Other values (17) | 152484 | 3.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 427898 | |
| U | 422899 | |
| C | 51338 | 5.1% |
| P | 16560 | 1.7% |
| R | 12128 | 1.2% |
| I | 10645 | 1.1% |
| A | 10000 | 1.0% |
| M | 6468 | 0.6% |
| D | 6444 | 0.6% |
| B | 5765 | 0.6% |
| Other values (15) | 31352 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 940 | |
| . | 2 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 452317 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6096677 | |
| Common | 453261 | 6.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 1291649 | |
| e | 891107 | |
| a | 672519 | |
| n | 536738 | |
| i | 496752 | 8.1% |
| d | 485872 | 8.0% |
| s | 453446 | 7.4% |
| S | 427898 | 7.0% |
| U | 422899 | 6.9% |
| C | 51338 | 0.8% |
| Other values (42) | 366459 | 6.0% |
Common
| Value | Count | Frequency (%) |
| 452317 | ||
| , | 940 | 0.2% |
| . | 2 | < 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6549937 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 1291649 | |
| e | 891107 | |
| a | 672519 | |
| n | 536738 | |
| i | 496752 | 7.6% |
| d | 485872 | 7.4% |
| s | 453446 | 6.9% |
| 452317 | 6.9% | |
| S | 427898 | 6.5% |
| U | 422899 | 6.5% |
| Other values (46) | 418740 | 6.4% |
None
| Value | Count | Frequency (%) |
| é | 1 |
stateProvince
Text
Missing 
| Distinct | 892 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 226462 |
| Missing (%) | 31.3% |
| Memory size | 5.5 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 23 |
| Mean length | 8.789222281 |
| Min length | 3 |
Unique
| Unique | 236 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Florida |
|---|---|
| 2nd row | Marsabit |
| 3rd row | Nevada |
| 4th row | Camaguey Prov |
| 5th row | North Carolina |
| Value | Count | Frequency (%) |
| carolina | 46813 | 7.5% |
| north | 45129 | 7.2% |
| texas | 38253 | 6.1% |
| colorado | 35917 | 5.8% |
| california | 32474 | 5.2% |
| columbia | 32203 | 5.2% |
| british | 32085 | 5.1% |
| alaska | 28545 | 4.6% |
| new | 23155 | 3.7% |
| wyoming | 22778 | 3.6% |
| Other values (878) | 287106 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 622536 | |
| i | 445132 | 10.2% |
| o | 412678 | 9.4% |
| r | 299951 | 6.9% |
| n | 262321 | 6.0% |
| l | 249350 | 5.7% |
| s | 213346 | 4.9% |
| e | 190372 | 4.3% |
| C | 155417 | 3.6% |
| t | 143584 | 3.3% |
| Other values (54) | 1382750 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3624857 | |
| Uppercase Letter | 625183 | 14.3% |
| Space Separator | 126412 | 2.9% |
| Dash Punctuation | 508 | < 0.1% |
| Other Punctuation | 475 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 622536 | |
| i | 445132 | |
| o | 412678 | |
| r | 299951 | |
| n | 262321 | 7.2% |
| l | 249350 | 6.9% |
| s | 213346 | 5.9% |
| e | 190372 | 5.3% |
| t | 143584 | 4.0% |
| h | 114639 | 3.2% |
| Other values (22) | 670948 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 155417 | |
| N | 87902 | |
| M | 48444 | 7.7% |
| T | 47635 | 7.6% |
| A | 45155 | 7.2% |
| B | 36744 | 5.9% |
| W | 32086 | 5.1% |
| H | 20814 | 3.3% |
| O | 19325 | 3.1% |
| I | 17859 | 2.9% |
| Other values (16) | 113802 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 425 | |
| ' | 50 | 10.5% |
Space Separator
| Value | Count | Frequency (%) |
| 126412 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 508 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4250040 | |
| Common | 127397 | 2.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 622536 | |
| i | 445132 | 10.5% |
| o | 412678 | 9.7% |
| r | 299951 | 7.1% |
| n | 262321 | 6.2% |
| l | 249350 | 5.9% |
| s | 213346 | 5.0% |
| e | 190372 | 4.5% |
| C | 155417 | 3.7% |
| t | 143584 | 3.4% |
| Other values (48) | 1255353 |
Common
| Value | Count | Frequency (%) |
| 126412 | ||
| - | 508 | 0.4% |
| . | 425 | 0.3% |
| ' | 50 | < 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4371514 | |
| None | 5923 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 622536 | |
| i | 445132 | 10.2% |
| o | 412678 | 9.4% |
| r | 299951 | 6.9% |
| n | 262321 | 6.0% |
| l | 249350 | 5.7% |
| s | 213346 | 4.9% |
| e | 190372 | 4.4% |
| C | 155417 | 3.6% |
| t | 143584 | 3.3% |
| Other values (48) | 1376827 |
None
| Value | Count | Frequency (%) |
| ó | 2622 | |
| í | 1945 | |
| á | 1034 | 17.5% |
| é | 319 | 5.4% |
| è | 2 | < 0.1% |
| ñ | 1 | < 0.1% |
county
Text
Missing 
| Distinct | 1997 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 454433 |
| Missing (%) | 62.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 29 |
| Mean length | 14.2528779 |
| Min length | 3 |
Unique
| Unique | 393 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Pershing County |
|---|---|
| 2nd row | Beaufort County |
| 3rd row | Brewster County |
| 4th row | Los Angeles County |
| 5th row | Honolulu County |
| Value | Count | Frequency (%) |
| county | 259124 | |
| beaufort | 33592 | 5.9% |
| brewster | 15677 | 2.8% |
| maui | 10401 | 1.8% |
| los | 8883 | 1.6% |
| angeles | 8865 | 1.6% |
| honolulu | 5926 | 1.0% |
| san | 4953 | 0.9% |
| lincoln | 4346 | 0.8% |
| culberson | 4132 | 0.7% |
| Other values (1945) | 212334 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 423340 | |
| n | 401510 | |
| t | 375302 | |
| u | 352655 | |
| 298158 | 7.7% | |
| C | 289740 | 7.5% |
| y | 279783 | 7.3% |
| e | 215178 | 5.6% |
| a | 186491 | 4.8% |
| r | 177010 | 4.6% |
| Other values (55) | 850179 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2976107 | |
| Uppercase Letter | 570194 | 14.8% |
| Space Separator | 298158 | 7.7% |
| Other Punctuation | 4230 | 0.1% |
| Dash Punctuation | 657 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 423340 | |
| n | 401510 | |
| t | 375302 | |
| u | 352655 | |
| y | 279783 | |
| e | 215178 | |
| a | 186491 | |
| r | 177010 | |
| l | 100058 | 3.4% |
| s | 96459 | 3.2% |
| Other values (23) | 368321 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 289740 | |
| B | 65415 | 11.5% |
| M | 27388 | 4.8% |
| S | 25040 | 4.4% |
| L | 22655 | 4.0% |
| P | 16991 | 3.0% |
| A | 16627 | 2.9% |
| H | 14879 | 2.6% |
| D | 12691 | 2.2% |
| W | 9829 | 1.7% |
| Other values (16) | 68939 | 12.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2609 | |
| ' | 1598 | |
| ? | 21 | 0.5% |
| , | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 298158 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 657 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3546301 | |
| Common | 303045 | 7.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 423340 | |
| n | 401510 | |
| t | 375302 | |
| u | 352655 | |
| C | 289740 | 8.2% |
| y | 279783 | 7.9% |
| e | 215178 | 6.1% |
| a | 186491 | 5.3% |
| r | 177010 | 5.0% |
| l | 100058 | 2.8% |
| Other values (49) | 745234 |
Common
| Value | Count | Frequency (%) |
| 298158 | ||
| . | 2609 | 0.9% |
| ' | 1598 | 0.5% |
| - | 657 | 0.2% |
| ? | 21 | < 0.1% |
| , | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3848100 | |
| None | 1246 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 423340 | |
| n | 401510 | |
| t | 375302 | |
| u | 352655 | |
| 298158 | 7.7% | |
| C | 289740 | 7.5% |
| y | 279783 | 7.3% |
| e | 215178 | 5.6% |
| a | 186491 | 4.8% |
| r | 177010 | 4.6% |
| Other values (48) | 848933 |
None
| Value | Count | Frequency (%) |
| ó | 851 | |
| é | 218 | 17.5% |
| í | 171 | 13.7% |
| á | 3 | 0.2% |
| ä | 1 | 0.1% |
| ñ | 1 | 0.1% |
| ú | 1 | 0.1% |
locality
Text
Missing 
| Distinct | 31755 |
|---|---|
| Distinct (%) | 19.4% |
| Missing | 560871 |
| Missing (%) | 77.4% |
| Memory size | 5.5 MiB |
Length
| Max length | 471 |
|---|---|
| Median length | 316 |
| Mean length | 59.79365302 |
| Min length | 1 |
Unique
| Unique | 21088 ? |
|---|---|
| Unique (%) | 12.9% |
Sample
| 1st row | St. Andrew Bay |
|---|---|
| 2nd row | Nuevitas Bay, Between Nuevitas And Pastelillo |
| 3rd row | Palos Verdes Hills; East side of Deadman's Island |
| 4th row | North slope of San Pedro Hills, ravine S of harbor City, 4200 feet N and 53.5 degrees E from 342-foot hill, 100 feet up ravine from end of Bellepoint Street (W98-30) |
| 5th row | Coyote Springs Valley; spring |
| Value | Count | Frequency (%) |
| of | 120156 | 7.0% |
| 34919 | 2.0% | |
| and | 22265 | 1.3% |
| bay | 19665 | 1.1% |
| the | 18421 | 1.1% |
| on | 17778 | 1.0% |
| from | 16823 | 1.0% |
| n | 16777 | 1.0% |
| feet | 15757 | 0.9% |
| river | 15334 | 0.9% |
| Other values (34131) | 1421831 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1556089 | 15.9% | |
| e | 696361 | 7.1% |
| a | 667574 | 6.8% |
| o | 563183 | 5.8% |
| n | 459218 | 4.7% |
| t | 454511 | 4.6% |
| r | 411334 | 4.2% |
| i | 400897 | 4.1% |
| l | 325764 | 3.3% |
| s | 321111 | 3.3% |
| Other values (90) | 3928412 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5944612 | |
| Space Separator | 1556089 | 15.9% |
| Uppercase Letter | 1178423 | 12.0% |
| Decimal Number | 550644 | 5.6% |
| Other Punctuation | 394583 | 4.0% |
| Dash Punctuation | 53241 | 0.5% |
| Open Punctuation | 40436 | 0.4% |
| Close Punctuation | 40130 | 0.4% |
| Math Symbol | 26252 | 0.3% |
| Connector Punctuation | 35 | < 0.1% |
| Other values (2) | 9 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 696361 | |
| a | 667574 | |
| o | 563183 | 9.5% |
| n | 459218 | 7.7% |
| t | 454511 | 7.6% |
| r | 411334 | 6.9% |
| i | 400897 | 6.7% |
| l | 325764 | 5.5% |
| s | 321111 | 5.4% |
| f | 214145 | 3.6% |
| Other values (21) | 1430514 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 174349 | |
| C | 112608 | 9.6% |
| O | 84502 | 7.2% |
| N | 76103 | 6.5% |
| B | 74870 | 6.4% |
| R | 70202 | 6.0% |
| P | 66766 | 5.7% |
| A | 62224 | 5.3% |
| W | 51082 | 4.3% |
| T | 49542 | 4.2% |
| Other values (17) | 356175 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 179506 | |
| . | 103955 | |
| ; | 73054 | |
| / | 19087 | 4.8% |
| ' | 7147 | 1.8% |
| : | 4428 | 1.1% |
| # | 4037 | 1.0% |
| " | 1994 | 0.5% |
| ? | 703 | 0.2% |
| & | 599 | 0.2% |
| Other values (5) | 73 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 125210 | |
| 0 | 82093 | |
| 2 | 69469 | |
| 5 | 50957 | |
| 3 | 50931 | |
| 4 | 49415 | 9.0% |
| 6 | 36615 | 6.6% |
| 7 | 31244 | 5.7% |
| 8 | 27594 | 5.0% |
| 9 | 27116 | 4.9% |
Math Symbol
| Value | Count | Frequency (%) |
| | | 22235 | |
| + | 2928 | 11.2% |
| = | 1045 | 4.0% |
| ± | 36 | 0.1% |
| ~ | 8 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 37729 | |
| { | 2081 | 5.1% |
| [ | 626 | 1.5% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 37422 | |
| } | 2082 | 5.2% |
| ] | 626 | 1.6% |
Currency Symbol
| Value | Count | Frequency (%) |
| $ | 3 | |
| € | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 1556089 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 53241 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 35 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7123035 | |
| Common | 2661419 | 27.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 696361 | 9.8% |
| a | 667574 | 9.4% |
| o | 563183 | 7.9% |
| n | 459218 | 6.4% |
| t | 454511 | 6.4% |
| r | 411334 | 5.8% |
| i | 400897 | 5.6% |
| l | 325764 | 4.6% |
| s | 321111 | 4.5% |
| f | 214145 | 3.0% |
| Other values (48) | 2608937 |
Common
| Value | Count | Frequency (%) |
| 1556089 | ||
| , | 179506 | 6.7% |
| 1 | 125210 | 4.7% |
| . | 103955 | 3.9% |
| 0 | 82093 | 3.1% |
| ; | 73054 | 2.7% |
| 2 | 69469 | 2.6% |
| - | 53241 | 2.0% |
| 5 | 50957 | 1.9% |
| 3 | 50931 | 1.9% |
| Other values (32) | 316914 | 11.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9784239 | |
| None | 213 | < 0.1% |
| Currency Symbols | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1556089 | 15.9% | |
| e | 696361 | 7.1% |
| a | 667574 | 6.8% |
| o | 563183 | 5.8% |
| n | 459218 | 4.7% |
| t | 454511 | 4.6% |
| r | 411334 | 4.2% |
| i | 400897 | 4.1% |
| l | 325764 | 3.3% |
| s | 321111 | 3.3% |
| Other values (81) | 3928197 |
None
| Value | Count | Frequency (%) |
| ñ | 93 | |
| ± | 36 | 16.9% |
| Ã | 36 | 16.9% |
| í | 27 | 12.7% |
| á | 14 | 6.6% |
| ° | 4 | 1.9% |
| é | 2 | 0.9% |
| ö | 1 | 0.5% |
Currency Symbols
| Value | Count | Frequency (%) |
| € | 2 |
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | 3.6% |
| Missing | 724311 |
| Missing (%) | > 99.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 88 |
|---|---|
| Median length | 88 |
| Mean length | 81.14720812 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 1.0% |
Sample
| 1st row | Elevation for Rampart Cave derived from Google Earth by Dr. Jim Mead on 4 Decemeber 2023 |
|---|---|
| 2nd row | Approx.450-500ft Above Base Of Fm |
| 3rd row | Elevation for Rampart Cave derived from Google Earth by Dr. Jim Mead on 4 Decemeber 2023 |
| 4th row | Elevation for Rampart Cave derived from Google Earth by Dr. Jim Mead on 4 Decemeber 2023 |
| 5th row | Elevation for Rampart Cave derived from Google Earth by Dr. Jim Mead on 4 Decemeber 2023 |
| Value | Count | Frequency (%) |
| elevation | 161 | 5.5% |
| by | 161 | 5.5% |
| 2023 | 161 | 5.5% |
| decemeber | 161 | 5.5% |
| 4 | 161 | 5.5% |
| mead | 161 | 5.5% |
| jim | 161 | 5.5% |
| dr | 161 | 5.5% |
| on | 161 | 5.5% |
| earth | 161 | 5.5% |
| Other values (38) | 1300 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2713 | ||
| e | 1696 | 10.6% |
| r | 1185 | 7.4% |
| o | 1092 | 6.8% |
| a | 1023 | 6.4% |
| m | 656 | 4.1% |
| t | 562 | 3.5% |
| v | 533 | 3.3% |
| i | 527 | 3.3% |
| d | 497 | 3.1% |
| Other values (45) | 5502 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10285 | |
| Space Separator | 2713 | 17.0% |
| Uppercase Letter | 1740 | 10.9% |
| Decimal Number | 968 | 6.1% |
| Other Punctuation | 239 | 1.5% |
| Math Symbol | 29 | 0.2% |
| Dash Punctuation | 12 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1696 | |
| r | 1185 | |
| o | 1092 | |
| a | 1023 | |
| m | 656 | 6.4% |
| t | 562 | 5.5% |
| v | 533 | 5.2% |
| i | 527 | 5.1% |
| d | 497 | 4.8% |
| n | 407 | 4.0% |
| Other values (13) | 2107 |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 322 | |
| E | 322 | |
| C | 194 | |
| M | 185 | |
| J | 161 | |
| G | 161 | |
| R | 161 | |
| A | 64 | 3.7% |
| B | 53 | 3.0% |
| O | 25 | 1.4% |
| Other values (8) | 92 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 354 | |
| 0 | 209 | |
| 4 | 173 | |
| 3 | 161 | |
| 5 | 40 | 4.1% |
| 1 | 25 | 2.6% |
| 6 | 5 | 0.5% |
| 8 | 1 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 196 | |
| , | 42 | 17.6% |
| / | 1 | 0.4% |
Space Separator
| Value | Count | Frequency (%) |
| 2713 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 29 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12025 | |
| Common | 3961 | 24.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1696 | |
| r | 1185 | 9.9% |
| o | 1092 | 9.1% |
| a | 1023 | 8.5% |
| m | 656 | 5.5% |
| t | 562 | 4.7% |
| v | 533 | 4.4% |
| i | 527 | 4.4% |
| d | 497 | 4.1% |
| n | 407 | 3.4% |
| Other values (31) | 3847 |
Common
| Value | Count | Frequency (%) |
| 2713 | ||
| 2 | 354 | 8.9% |
| 0 | 209 | 5.3% |
| . | 196 | 4.9% |
| 4 | 173 | 4.4% |
| 3 | 161 | 4.1% |
| , | 42 | 1.1% |
| 5 | 40 | 1.0% |
| + | 29 | 0.7% |
| 1 | 25 | 0.6% |
| Other values (4) | 19 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 15986 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2713 | ||
| e | 1696 | 10.6% |
| r | 1185 | 7.4% |
| o | 1092 | 6.8% |
| a | 1023 | 6.4% |
| m | 656 | 4.1% |
| t | 562 | 3.5% |
| v | 533 | 3.3% |
| i | 527 | 3.3% |
| d | 497 | 3.1% |
| Other values (45) | 5502 |
verbatimDepth
Text
Missing 
| Distinct | 17 |
|---|---|
| Distinct (%) | 20.2% |
| Missing | 724424 |
| Missing (%) | > 99.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 10 |
| Mean length | 5.523809524 |
| Min length | 4 |
Unique
| Unique | 9 ? |
|---|---|
| Unique (%) | 10.7% |
Sample
| 1st row | reef |
|---|---|
| 2nd row | Beach |
| 3rd row | ?48 Ms |
| 4th row | Beach |
| 5th row | Intertidal |
| Value | Count | Frequency (%) |
| reef | 30 | |
| beach | 25 | |
| low | 9 | 8.3% |
| ms | 8 | 7.3% |
| water | 7 | 6.4% |
| 48 | 6 | 5.5% |
| no.4 | 4 | 3.7% |
| mnb | 3 | 2.8% |
| 57ms | 2 | 1.8% |
| 25 | 2 | 1.8% |
| Other values (12) | 13 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 96 | |
| r | 40 | 8.6% |
| a | 37 | 8.0% |
| f | 31 | 6.7% |
| c | 26 | 5.6% |
| h | 25 | 5.4% |
| 25 | 5.4% | |
| b | 18 | 3.9% |
| o | 13 | 2.8% |
| t | 13 | 2.8% |
| Other values (30) | 140 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 339 | |
| Uppercase Letter | 51 | 11.0% |
| Decimal Number | 32 | 6.9% |
| Space Separator | 25 | 5.4% |
| Other Punctuation | 16 | 3.4% |
| Dash Punctuation | 1 | 0.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 96 | |
| r | 40 | |
| a | 37 | 10.9% |
| f | 31 | 9.1% |
| c | 26 | 7.7% |
| h | 25 | 7.4% |
| b | 18 | 5.3% |
| o | 13 | 3.8% |
| t | 13 | 3.8% |
| s | 10 | 2.9% |
| Other values (7) | 30 | 8.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 12 | |
| B | 10 | |
| L | 9 | |
| W | 8 | |
| N | 4 | 7.8% |
| F | 2 | 3.9% |
| A | 1 | 2.0% |
| S | 1 | 2.0% |
| U | 1 | 2.0% |
| C | 1 | 2.0% |
| Other values (2) | 2 | 3.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 11 | |
| 8 | 8 | |
| 5 | 4 | 12.5% |
| 7 | 3 | 9.4% |
| 0 | 3 | 9.4% |
| 2 | 2 | 6.2% |
| 3 | 1 | 3.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 10 | |
| ? | 6 |
Space Separator
| Value | Count | Frequency (%) |
| 25 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 390 | |
| Common | 74 | 15.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 96 | |
| r | 40 | |
| a | 37 | 9.5% |
| f | 31 | 7.9% |
| c | 26 | 6.7% |
| h | 25 | 6.4% |
| b | 18 | 4.6% |
| o | 13 | 3.3% |
| t | 13 | 3.3% |
| M | 12 | 3.1% |
| Other values (19) | 79 |
Common
| Value | Count | Frequency (%) |
| 25 | ||
| 4 | 11 | |
| . | 10 | 13.5% |
| 8 | 8 | 10.8% |
| ? | 6 | 8.1% |
| 5 | 4 | 5.4% |
| 7 | 3 | 4.1% |
| 0 | 3 | 4.1% |
| 2 | 2 | 2.7% |
| - | 1 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 464 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 96 | |
| r | 40 | 8.6% |
| a | 37 | 8.0% |
| f | 31 | 6.7% |
| c | 26 | 5.6% |
| h | 25 | 5.4% |
| 25 | 5.4% | |
| b | 18 | 3.9% |
| o | 13 | 2.8% |
| t | 13 | 2.8% |
| Other values (30) | 140 |
decimalLatitude
Text
Missing 
| Distinct | 34307 |
|---|---|
| Distinct (%) | 33.0% |
| Missing | 620569 |
| Missing (%) | 85.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.719883778 |
| Min length | 3 |
Unique
| Unique | 19066 ? |
|---|---|
| Unique (%) | 18.3% |
Sample
| 1st row | 30.1564 |
|---|---|
| 2nd row | 36.9858 |
| 3rd row | 31.9911 |
| 4th row | 69.08 |
| 5th row | 17.8883 |
| Value | Count | Frequency (%) |
| 44.6458 | 1686 | 1.6% |
| 17.5 | 673 | 0.6% |
| 29.8119 | 329 | 0.3% |
| 33.1767 | 323 | 0.3% |
| 34.6405 | 307 | 0.3% |
| 38.8295 | 287 | 0.3% |
| 41.1458 | 279 | 0.3% |
| 48.1104 | 243 | 0.2% |
| 40.6184 | 235 | 0.2% |
| 31.6767 | 227 | 0.2% |
| Other values (34049) | 99350 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 103939 | |
| 3 | 93842 | |
| 4 | 66308 | |
| 5 | 65933 | |
| 8 | 57884 | |
| 1 | 55433 | |
| 7 | 55155 | |
| 6 | 54645 | |
| 2 | 54452 | |
| 9 | 45816 | |
| Other values (2) | 45051 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 588796 | |
| Other Punctuation | 103939 | 14.9% |
| Dash Punctuation | 5723 | 0.8% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 93842 | |
| 4 | 66308 | |
| 5 | 65933 | |
| 8 | 57884 | |
| 1 | 55433 | |
| 7 | 55155 | |
| 6 | 54645 | |
| 2 | 54452 | |
| 9 | 45816 | |
| 0 | 39328 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 103939 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 5723 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 698458 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 103939 | |
| 3 | 93842 | |
| 4 | 66308 | |
| 5 | 65933 | |
| 8 | 57884 | |
| 1 | 55433 | |
| 7 | 55155 | |
| 6 | 54645 | |
| 2 | 54452 | |
| 9 | 45816 | |
| Other values (2) | 45051 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 698458 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 103939 | |
| 3 | 93842 | |
| 4 | 66308 | |
| 5 | 65933 | |
| 8 | 57884 | |
| 1 | 55433 | |
| 7 | 55155 | |
| 6 | 54645 | |
| 2 | 54452 | |
| 9 | 45816 | |
| Other values (2) | 45051 |
decimalLongitude
Text
Missing 
| Distinct | 35344 |
|---|---|
| Distinct (%) | 34.0% |
| Missing | 620569 |
| Missing (%) | 85.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 8 |
| Mean length | 7.641020214 |
| Min length | 3 |
Unique
| Unique | 19861 ? |
|---|---|
| Unique (%) | 19.1% |
Sample
| 1st row | -85.6439 |
|---|---|
| 2nd row | -114.996 |
| 3rd row | -80.7842 |
| 4th row | -155.83 |
| 5th row | -66.52 |
| Value | Count | Frequency (%) |
| 123.908 | 1686 | 1.6% |
| 95.0833 | 673 | 0.6% |
| 103.252 | 329 | 0.3% |
| 98.6878 | 321 | 0.3% |
| 105.851 | 307 | 0.3% |
| 76.8473 | 287 | 0.3% |
| 115.358 | 279 | 0.3% |
| 123.934 | 243 | 0.2% |
| 108.207 | 235 | 0.2% |
| 123.18 | 230 | 0.2% |
| Other values (35142) | 99349 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 103939 | |
| - | 95620 | |
| 1 | 88364 | |
| 7 | 72540 | |
| 8 | 71709 | |
| 3 | 62429 | |
| 6 | 55880 | |
| 5 | 55457 | |
| 2 | 52919 | |
| 9 | 50099 | |
| Other values (2) | 85244 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 594641 | |
| Other Punctuation | 103939 | 13.1% |
| Dash Punctuation | 95620 | 12.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 88364 | |
| 7 | 72540 | |
| 8 | 71709 | |
| 3 | 62429 | |
| 6 | 55880 | |
| 5 | 55457 | |
| 2 | 52919 | |
| 9 | 50099 | |
| 4 | 45122 | |
| 0 | 40122 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 103939 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 95620 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 794200 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 103939 | |
| - | 95620 | |
| 1 | 88364 | |
| 7 | 72540 | |
| 8 | 71709 | |
| 3 | 62429 | |
| 6 | 55880 | |
| 5 | 55457 | |
| 2 | 52919 | |
| 9 | 50099 | |
| Other values (2) | 85244 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 794200 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 103939 | |
| - | 95620 | |
| 1 | 88364 | |
| 7 | 72540 | |
| 8 | 71709 | |
| 3 | 62429 | |
| 6 | 55880 | |
| 5 | 55457 | |
| 2 | 52919 | |
| 9 | 50099 | |
| Other values (2) | 85244 |
geodeticDatum
Text
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 698201 |
| Missing (%) | 96.4% |
| Memory size | 5.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 18 |
| Mean length | 17.69483407 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | WGS 84 (EPSG:4326) |
|---|---|
| 2nd row | WGS 84 (EPSG:4326) |
| 3rd row | WGS 84 (EPSG:4326) |
| 4th row | WGS 84 (EPSG:4326) |
| 5th row | WGS 84 (EPSG:4326) |
| Value | Count | Frequency (%) |
| wgs | 24628 | |
| 84 | 24628 | |
| epsg:4326 | 24628 | |
| nad27 | 561 | 0.7% |
| epsg:4267 | 561 | 0.7% |
| nad83 | 474 | 0.6% |
| epsg:4269 | 474 | 0.6% |
| wgs84 | 447 | 0.6% |
| not | 197 | 0.3% |
| recorded | 197 | 0.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| G | 50738 | |
| S | 50738 | |
| 4 | 50738 | |
| 50488 | ||
| 2 | 26224 | 5.6% |
| ) | 25663 | 5.5% |
| ( | 25663 | 5.5% |
| E | 25663 | 5.5% |
| P | 25663 | 5.5% |
| : | 25663 | 5.5% |
| Other values (16) | 108257 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 180982 | |
| Decimal Number | 154872 | |
| Space Separator | 50488 | 10.8% |
| Close Punctuation | 25663 | 5.5% |
| Open Punctuation | 25663 | 5.5% |
| Other Punctuation | 25663 | 5.5% |
| Lowercase Letter | 2167 | 0.5% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 50738 | |
| S | 50738 | |
| E | 25663 | |
| P | 25663 | |
| W | 25075 | |
| N | 1035 | 0.6% |
| A | 1035 | 0.6% |
| D | 1035 | 0.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 50738 | |
| 2 | 26224 | |
| 6 | 25663 | |
| 8 | 25549 | |
| 3 | 25102 | |
| 7 | 1122 | 0.7% |
| 9 | 474 | 0.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 394 | |
| r | 394 | |
| e | 394 | |
| d | 394 | |
| n | 197 | |
| t | 197 | |
| c | 197 |
Space Separator
| Value | Count | Frequency (%) |
| 50488 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 25663 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 25663 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 25663 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 282349 | |
| Latin | 183149 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| G | 50738 | |
| S | 50738 | |
| E | 25663 | |
| P | 25663 | |
| W | 25075 | |
| N | 1035 | 0.6% |
| A | 1035 | 0.6% |
| D | 1035 | 0.6% |
| o | 394 | 0.2% |
| r | 394 | 0.2% |
| Other values (5) | 1379 | 0.8% |
Common
| Value | Count | Frequency (%) |
| 4 | 50738 | |
| 50488 | ||
| 2 | 26224 | |
| ) | 25663 | |
| ( | 25663 | |
| : | 25663 | |
| 6 | 25663 | |
| 8 | 25549 | |
| 3 | 25102 | |
| 7 | 1122 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 465498 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| G | 50738 | |
| S | 50738 | |
| 4 | 50738 | |
| 50488 | ||
| 2 | 26224 | 5.6% |
| ) | 25663 | 5.5% |
| ( | 25663 | 5.5% |
| E | 25663 | 5.5% |
| P | 25663 | 5.5% |
| : | 25663 | 5.5% |
| Other values (16) | 108257 |
verbatimLatitude
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 40.0% |
| Missing | 724503 |
| Missing (%) | > 99.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9.4 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 11 53.4 N |
|---|---|
| 2nd row | 11 53.4 N |
| 3rd row | 11 53.4 N |
| 4th row | 18 44.98 N |
| 5th row | 18 44.98 N |
| Value | Count | Frequency (%) |
| n | 5 | |
| 11 | 3 | |
| 53.4 | 3 | |
| 18 | 2 | 13.3% |
| 44.98 | 2 | 13.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 10 | ||
| 1 | 8 | |
| 4 | 7 | |
| . | 5 | |
| N | 5 | |
| 8 | 4 | 8.5% |
| 5 | 3 | 6.4% |
| 3 | 3 | 6.4% |
| 9 | 2 | 4.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 27 | |
| Space Separator | 10 | 21.3% |
| Other Punctuation | 5 | 10.6% |
| Uppercase Letter | 5 | 10.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8 | |
| 4 | 7 | |
| 8 | 4 | |
| 5 | 3 | 11.1% |
| 3 | 3 | 11.1% |
| 9 | 2 | 7.4% |
Space Separator
| Value | Count | Frequency (%) |
| 10 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 42 | |
| Latin | 5 | 10.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 10 | ||
| 1 | 8 | |
| 4 | 7 | |
| . | 5 | |
| 8 | 4 | 9.5% |
| 5 | 3 | 7.1% |
| 3 | 3 | 7.1% |
| 9 | 2 | 4.8% |
Latin
| Value | Count | Frequency (%) |
| N | 5 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 47 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 10 | ||
| 1 | 8 | |
| 4 | 7 | |
| . | 5 | |
| N | 5 | |
| 8 | 4 | 8.5% |
| 5 | 3 | 6.4% |
| 3 | 3 | 6.4% |
| 9 | 2 | 4.3% |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 40.0% |
| Missing | 724503 |
| Missing (%) | > 99.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9.4 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 48 14.7 E |
|---|---|
| 2nd row | 48 14.7 E |
| 3rd row | 48 14.7 E |
| 4th row | 60 07.78 E |
| 5th row | 60 07.78 E |
| Value | Count | Frequency (%) |
| e | 5 | |
| 48 | 3 | |
| 14.7 | 3 | |
| 60 | 2 | 13.3% |
| 07.78 | 2 | 13.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| 10 | ||
| 7 | 7 | |
| 4 | 6 | |
| 8 | 5 | |
| . | 5 | |
| E | 5 | |
| 0 | 4 | 8.5% |
| 1 | 3 | 6.4% |
| 6 | 2 | 4.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 27 | |
| Space Separator | 10 | 21.3% |
| Other Punctuation | 5 | 10.6% |
| Uppercase Letter | 5 | 10.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 7 | |
| 4 | 6 | |
| 8 | 5 | |
| 0 | 4 | |
| 1 | 3 | |
| 6 | 2 | 7.4% |
Space Separator
| Value | Count | Frequency (%) |
| 10 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 42 | |
| Latin | 5 | 10.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 10 | ||
| 7 | 7 | |
| 4 | 6 | |
| 8 | 5 | |
| . | 5 | |
| 0 | 4 | 9.5% |
| 1 | 3 | 7.1% |
| 6 | 2 | 4.8% |
Latin
| Value | Count | Frequency (%) |
| E | 5 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 47 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 10 | ||
| 7 | 7 | |
| 4 | 6 | |
| 8 | 5 | |
| . | 5 | |
| E | 5 | |
| 0 | 4 | 8.5% |
| 1 | 3 | 6.4% |
| 6 | 2 | 4.3% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 654265 |
| Missing (%) | 90.3% |
| Memory size | 5.5 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 23 |
| Min length | 23 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|---|
| 2nd row | Degrees Minutes Seconds |
| 3rd row | Degrees Minutes Seconds |
| 4th row | Degrees Minutes Seconds |
| 5th row | Degrees Minutes Seconds |
| Value | Count | Frequency (%) |
| degrees | 70243 | |
| minutes | 70243 | |
| seconds | 70243 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 351215 | |
| s | 210729 | |
| 140486 | 8.7% | |
| n | 140486 | 8.7% |
| D | 70243 | 4.3% |
| g | 70243 | 4.3% |
| r | 70243 | 4.3% |
| M | 70243 | 4.3% |
| i | 70243 | 4.3% |
| u | 70243 | 4.3% |
| Other values (5) | 351215 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1264374 | |
| Uppercase Letter | 210729 | 13.0% |
| Space Separator | 140486 | 8.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 351215 | |
| s | 210729 | |
| n | 140486 | 11.1% |
| g | 70243 | 5.6% |
| r | 70243 | 5.6% |
| i | 70243 | 5.6% |
| u | 70243 | 5.6% |
| t | 70243 | 5.6% |
| c | 70243 | 5.6% |
| o | 70243 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 70243 | |
| M | 70243 | |
| S | 70243 |
Space Separator
| Value | Count | Frequency (%) |
| 140486 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1475103 | |
| Common | 140486 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 351215 | |
| s | 210729 | |
| n | 140486 | 9.5% |
| D | 70243 | 4.8% |
| g | 70243 | 4.8% |
| r | 70243 | 4.8% |
| M | 70243 | 4.8% |
| i | 70243 | 4.8% |
| u | 70243 | 4.8% |
| t | 70243 | 4.8% |
| Other values (4) | 280972 |
Common
| Value | Count | Frequency (%) |
| 140486 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1615589 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 351215 | |
| s | 210729 | |
| 140486 | 8.7% | |
| n | 140486 | 8.7% |
| D | 70243 | 4.3% |
| g | 70243 | 4.3% |
| r | 70243 | 4.3% |
| M | 70243 | 4.3% |
| i | 70243 | 4.3% |
| u | 70243 | 4.3% |
| Other values (5) | 351215 |
Missing 
| Distinct | 19 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 695012 |
| Missing (%) | 95.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 81 |
|---|---|
| Median length | 43 |
| Mean length | 42.23633713 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Georeferencing Quick Reference Guide (2020) |
|---|---|
| 2nd row | Georeferencing Quick Reference Guide (2020) |
| 3rd row | Georeferencing Quick Reference Guide (2020) |
| 4th row | Georeferencing Quick Reference Guide (2020) |
| 5th row | Georeferencing Quick Reference Guide (2020) |
| Value | Count | Frequency (%) |
| georeferencing | 26344 | |
| guide | 26344 | |
| reference | 24178 | |
| 2020 | 24178 | |
| quick | 24178 | |
| biogeomancer | 2166 | 1.4% |
| 2006 | 2166 | 1.4% |
| august | 2166 | 1.4% |
| consortium | 2166 | 1.4% |
| for | 2166 | 1.4% |
| Other values (32) | 13421 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 237471 | |
| 119977 | 9.6% | |
| r | 87730 | 7.0% |
| i | 84069 | 6.7% |
| n | 82720 | 6.6% |
| c | 81302 | 6.5% |
| u | 58822 | 4.7% |
| G | 54854 | 4.4% |
| 0 | 52731 | 4.2% |
| f | 52688 | 4.2% |
| Other values (40) | 333439 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 844245 | |
| Uppercase Letter | 121633 | 9.8% |
| Space Separator | 119977 | 9.6% |
| Decimal Number | 105634 | 8.5% |
| Open Punctuation | 24178 | 1.9% |
| Close Punctuation | 24178 | 1.9% |
| Other Punctuation | 5915 | 0.5% |
| Math Symbol | 43 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 237471 | |
| r | 87730 | 10.4% |
| i | 84069 | 10.0% |
| n | 82720 | 9.8% |
| c | 81302 | 9.6% |
| u | 58822 | 7.0% |
| f | 52688 | 6.2% |
| o | 40962 | 4.9% |
| g | 28625 | 3.4% |
| d | 28111 | 3.3% |
| Other values (12) | 61745 | 7.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 54854 | |
| Q | 25508 | |
| R | 24645 | |
| B | 4332 | 3.6% |
| A | 3450 | 2.8% |
| C | 2537 | 2.1% |
| P | 2195 | 1.8% |
| M | 1338 | 1.1% |
| L | 1299 | 1.1% |
| V | 351 | 0.3% |
| Other values (6) | 1124 | 0.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 52731 | |
| 2 | 50522 | |
| 6 | 2166 | 2.1% |
| 5 | 129 | 0.1% |
| 4 | 43 | < 0.1% |
| 8 | 43 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3205 | |
| , | 2710 |
Space Separator
| Value | Count | Frequency (%) |
| 119977 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 24178 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 24178 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 43 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 965878 | |
| Common | 279925 | 22.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 237471 | |
| r | 87730 | 9.1% |
| i | 84069 | 8.7% |
| n | 82720 | 8.6% |
| c | 81302 | 8.4% |
| u | 58822 | 6.1% |
| G | 54854 | 5.7% |
| f | 52688 | 5.5% |
| o | 40962 | 4.2% |
| g | 28625 | 3.0% |
| Other values (28) | 156635 |
Common
| Value | Count | Frequency (%) |
| 119977 | ||
| 0 | 52731 | |
| 2 | 50522 | |
| ( | 24178 | 8.6% |
| ) | 24178 | 8.6% |
| . | 3205 | 1.1% |
| , | 2710 | 1.0% |
| 6 | 2166 | 0.8% |
| 5 | 129 | < 0.1% |
| 4 | 43 | < 0.1% |
| Other values (2) | 86 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1245803 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 237471 | |
| 119977 | 9.6% | |
| r | 87730 | 7.0% |
| i | 84069 | 6.7% |
| n | 82720 | 6.6% |
| c | 81302 | 6.5% |
| u | 58822 | 4.7% |
| G | 54854 | 4.4% |
| 0 | 52731 | 4.2% |
| f | 52688 | 4.2% |
| Other values (40) | 333439 |
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 40.0% |
| Missing | 724503 |
| Missing (%) | > 99.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 70 |
|---|---|
| Median length | 70 |
| Mean length | 58 |
| Min length | 10 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 20.0% |
Sample
| 1st row | A; B; C; D |
|---|---|
| 2nd row | included in Jennifer Jett's Foram Bulk DB but not included in F Ledger |
| 3rd row | included in Jennifer Jett's Foram Bulk DB but not included in F Ledger |
| 4th row | included in Jennifer Jett's Foram Bulk DB but not included in F Ledger |
| 5th row | included in Jennifer Jett's Foram Bulk DB but not included in F Ledger |
| Value | Count | Frequency (%) |
| included | 8 | |
| in | 8 | |
| jennifer | 4 | |
| jett's | 4 | |
| foram | 4 | |
| bulk | 4 | |
| db | 4 | |
| but | 4 | |
| not | 4 | |
| f | 4 | |
| Other values (5) | 8 |
Most occurring characters
| Value | Count | Frequency (%) |
| 51 | ||
| n | 28 | 9.7% |
| e | 28 | 9.7% |
| i | 20 | 6.9% |
| d | 20 | 6.9% |
| u | 16 | 5.5% |
| t | 16 | 5.5% |
| r | 12 | 4.1% |
| l | 12 | 4.1% |
| B | 9 | 3.1% |
| Other values (17) | 78 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 196 | |
| Space Separator | 51 | 17.6% |
| Uppercase Letter | 36 | 12.4% |
| Other Punctuation | 7 | 2.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 28 | |
| e | 28 | |
| i | 20 | |
| d | 20 | |
| u | 16 | |
| t | 16 | |
| r | 12 | |
| l | 12 | |
| c | 8 | 4.1% |
| o | 8 | 4.1% |
| Other values (7) | 28 |
Uppercase Letter
| Value | Count | Frequency (%) |
| B | 9 | |
| J | 8 | |
| F | 8 | |
| D | 5 | |
| L | 4 | |
| A | 1 | 2.8% |
| C | 1 | 2.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 4 | |
| ; | 3 |
Space Separator
| Value | Count | Frequency (%) |
| 51 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 232 | |
| Common | 58 | 20.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 28 | |
| e | 28 | |
| i | 20 | 8.6% |
| d | 20 | 8.6% |
| u | 16 | 6.9% |
| t | 16 | 6.9% |
| r | 12 | 5.2% |
| l | 12 | 5.2% |
| B | 9 | 3.9% |
| J | 8 | 3.4% |
| Other values (14) | 63 |
Common
| Value | Count | Frequency (%) |
| 51 | ||
| ' | 4 | 6.9% |
| ; | 3 | 5.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 290 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 51 | ||
| n | 28 | 9.7% |
| e | 28 | 9.7% |
| i | 20 | 6.9% |
| d | 20 | 6.9% |
| u | 16 | 5.5% |
| t | 16 | 5.5% |
| r | 12 | 4.1% |
| l | 12 | 4.1% |
| B | 9 | 3.1% |
| Other values (17) | 78 |
earliestEraOrLowestErathem
Text
Missing 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 220036 |
| Missing (%) | 30.4% |
| Memory size | 5.5 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 8 |
| Mean length | 8.387123567 |
| Min length | 8 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Mesozoic |
|---|---|
| 2nd row | Cenozoic |
| 3rd row | Cenozoic |
| 4th row | Paleozoic |
| 5th row | Cenozoic |
| Value | Count | Frequency (%) |
| cenozoic | 261752 | |
| paleozoic | 194023 | |
| mesozoic | 48343 | 9.6% |
| precambrian | 298 | 0.1% |
| mesoproterozoic | 41 | < 0.1% |
| neoproterozoic | 7 | < 0.1% |
| paleoproterozoic | 4 | < 0.1% |
| paleoarchean | 3 | < 0.1% |
| mesoarchean | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 1008448 | |
| e | 504528 | |
| c | 504472 | |
| i | 504468 | |
| z | 504170 | |
| n | 262054 | 6.2% |
| C | 261752 | 6.2% |
| a | 194634 | 4.6% |
| P | 194327 | 4.6% |
| l | 194030 | 4.6% |
| Other values (9) | 98186 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3726598 | |
| Uppercase Letter | 504471 | 11.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1008448 | |
| e | 504528 | |
| c | 504472 | |
| i | 504468 | |
| z | 504170 | |
| n | 262054 | 7.0% |
| a | 194634 | 5.2% |
| l | 194030 | 5.2% |
| s | 48385 | 1.3% |
| r | 704 | < 0.1% |
| Other values (5) | 705 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 261752 | |
| P | 194327 | |
| M | 48385 | 9.6% |
| N | 7 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4231069 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 1008448 | |
| e | 504528 | |
| c | 504472 | |
| i | 504468 | |
| z | 504170 | |
| n | 262054 | 6.2% |
| C | 261752 | 6.2% |
| a | 194634 | 4.6% |
| P | 194327 | 4.6% |
| l | 194030 | 4.6% |
| Other values (9) | 98186 | 2.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4231069 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 1008448 | |
| e | 504528 | |
| c | 504472 | |
| i | 504468 | |
| z | 504170 | |
| n | 262054 | 6.2% |
| C | 261752 | 6.2% |
| a | 194634 | 4.6% |
| P | 194327 | 4.6% |
| l | 194030 | 4.6% |
| Other values (9) | 98186 | 2.3% |
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 718163 |
| Missing (%) | 99.1% |
| Memory size | 5.5 MiB |
Length
| Max length | 15 |
|---|---|
| Median length | 8 |
| Mean length | 8.134121355 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Paleozoic |
|---|---|
| 2nd row | Cenozoic |
| 3rd row | Mesozoic |
| 4th row | Cenozoic |
| 5th row | Cenozoic |
| Value | Count | Frequency (%) |
| cenozoic | 5229 | |
| paleozoic | 826 | 13.0% |
| mesozoic | 286 | 4.5% |
| neoproterozoic | 3 | < 0.1% |
| mesoproterozoic | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 12698 | |
| e | 6349 | |
| z | 6345 | |
| i | 6345 | |
| c | 6345 | |
| C | 5229 | |
| n | 5229 | |
| P | 826 | 1.6% |
| a | 826 | 1.6% |
| l | 826 | 1.6% |
| Other values (6) | 593 | 1.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 45266 | |
| Uppercase Letter | 6345 | 12.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 12698 | |
| e | 6349 | |
| z | 6345 | |
| i | 6345 | |
| c | 6345 | |
| n | 5229 | |
| a | 826 | 1.8% |
| l | 826 | 1.8% |
| s | 287 | 0.6% |
| r | 8 | < 0.1% |
| Other values (2) | 8 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 5229 | |
| P | 826 | 13.0% |
| M | 287 | 4.5% |
| N | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 51611 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 12698 | |
| e | 6349 | |
| z | 6345 | |
| i | 6345 | |
| c | 6345 | |
| C | 5229 | |
| n | 5229 | |
| P | 826 | 1.6% |
| a | 826 | 1.6% |
| l | 826 | 1.6% |
| Other values (6) | 593 | 1.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51611 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 12698 | |
| e | 6349 | |
| z | 6345 | |
| i | 6345 | |
| c | 6345 | |
| C | 5229 | |
| n | 5229 | |
| P | 826 | 1.6% |
| a | 826 | 1.6% |
| l | 826 | 1.6% |
| Other values (6) | 593 | 1.1% |
earliestPeriodOrLowestSystem
Text
Missing 
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 245750 |
| Missing (%) | 33.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 8.607453035 |
| Min length | 6 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Triassic |
|---|---|
| 2nd row | Paleogene |
| 3rd row | Neogene |
| 4th row | Permian |
| 5th row | Quaternary |
| Value | Count | Frequency (%) |
| paleogene | 90464 | |
| neogene | 72075 | |
| cambrian | 48808 | |
| recent | 41336 | |
| ordovician | 34462 | 7.2% |
| cretaceous | 34238 | 7.2% |
| permian | 32455 | 6.8% |
| quaternary | 27798 | 5.8% |
| devonian | 27637 | 5.8% |
| mississippian | 19734 | 4.1% |
| Other values (14) | 49751 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 751141 | |
| n | 506768 | |
| a | 458678 | |
| i | 322536 | 7.8% |
| o | 263741 | 6.4% |
| r | 242986 | 5.9% |
| g | 162539 | 3.9% |
| s | 160613 | 3.9% |
| P | 140533 | 3.4% |
| c | 124669 | 3.0% |
| Other values (25) | 986683 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3642156 | |
| Uppercase Letter | 478731 | 11.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 751141 | |
| n | 506768 | |
| a | 458678 | |
| i | 322536 | |
| o | 263741 | 7.2% |
| r | 242986 | 6.7% |
| g | 162539 | 4.5% |
| s | 160613 | 4.4% |
| c | 124669 | 3.4% |
| l | 120100 | 3.3% |
| Other values (11) | 528385 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 140533 | |
| C | 84743 | |
| N | 72075 | |
| R | 41337 | 8.6% |
| O | 34462 | 7.2% |
| Q | 27798 | 5.8% |
| D | 27637 | 5.8% |
| M | 20068 | 4.2% |
| S | 11625 | 2.4% |
| T | 9097 | 1.9% |
| Other values (4) | 9356 | 2.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4120887 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 751141 | |
| n | 506768 | |
| a | 458678 | |
| i | 322536 | 7.8% |
| o | 263741 | 6.4% |
| r | 242986 | 5.9% |
| g | 162539 | 3.9% |
| s | 160613 | 3.9% |
| P | 140533 | 3.4% |
| c | 124669 | 3.0% |
| Other values (25) | 986683 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4120887 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 751141 | |
| n | 506768 | |
| a | 458678 | |
| i | 322536 | 7.8% |
| o | 263741 | 6.4% |
| r | 242986 | 5.9% |
| g | 162539 | 3.9% |
| s | 160613 | 3.9% |
| P | 140533 | 3.4% |
| c | 124669 | 3.0% |
| Other values (25) | 986683 |
latestPeriodOrHighestSystem
Text
Missing 
| Distinct | 15 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 718167 |
| Missing (%) | 99.1% |
| Memory size | 5.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 10 |
| Mean length | 8.077905693 |
| Min length | 6 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Devonian |
|---|---|
| 2nd row | Neogene |
| 3rd row | Cretaceous |
| 4th row | Quaternary |
| 5th row | Recent |
| Value | Count | Frequency (%) |
| neogene | 3161 | |
| paleogene | 1404 | |
| quaternary | 668 | 10.5% |
| devonian | 416 | 6.6% |
| cretaceous | 185 | 2.9% |
| cambrian | 161 | 2.5% |
| ordovician | 137 | 2.2% |
| pennsylvanian | 77 | 1.2% |
| recent | 60 | 0.9% |
| silurian | 30 | 0.5% |
| Other values (5) | 42 | 0.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 15352 | |
| n | 6768 | |
| o | 5307 | 10.4% |
| g | 4565 | 8.9% |
| a | 4026 | 7.9% |
| N | 3161 | 6.2% |
| r | 1892 | 3.7% |
| l | 1511 | 2.9% |
| P | 1484 | 2.9% |
| i | 1053 | 2.1% |
| Other values (18) | 6103 | 11.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 44881 | |
| Uppercase Letter | 6341 | 12.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 15352 | |
| n | 6768 | |
| o | 5307 | 11.8% |
| g | 4565 | 10.2% |
| a | 4026 | 9.0% |
| r | 1892 | 4.2% |
| l | 1511 | 3.4% |
| i | 1053 | 2.3% |
| t | 914 | 2.0% |
| u | 898 | 2.0% |
| Other values (8) | 2595 | 5.8% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 3161 | |
| P | 1484 | |
| Q | 668 | 10.5% |
| D | 416 | 6.6% |
| C | 348 | 5.5% |
| O | 137 | 2.2% |
| R | 60 | 0.9% |
| S | 31 | 0.5% |
| T | 23 | 0.4% |
| J | 13 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 51222 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 15352 | |
| n | 6768 | |
| o | 5307 | 10.4% |
| g | 4565 | 8.9% |
| a | 4026 | 7.9% |
| N | 3161 | 6.2% |
| r | 1892 | 3.7% |
| l | 1511 | 2.9% |
| P | 1484 | 2.9% |
| i | 1053 | 2.1% |
| Other values (18) | 6103 | 11.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 51222 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 15352 | |
| n | 6768 | |
| o | 5307 | 10.4% |
| g | 4565 | 8.9% |
| a | 4026 | 7.9% |
| N | 3161 | 6.2% |
| r | 1892 | 3.7% |
| l | 1511 | 2.9% |
| P | 1484 | 2.9% |
| i | 1053 | 2.1% |
| Other values (18) | 6103 | 11.9% |
earliestEpochOrLowestSeries
Text
Missing 
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 376914 |
| Missing (%) | 52.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 11 |
| Mean length | 6.357434248 |
| Min length | 1 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Middle |
|---|---|
| 2nd row | Eocene |
| 3rd row | Pliocene |
| 4th row | Pleistocene |
| 5th row | Early |
| Value | Count | Frequency (%) |
| middle | 68576 | |
| eocene | 66980 | |
| late | 57993 | |
| miocene | 39410 | |
| early | 37474 | |
| pliocene | 32039 | |
| pleistocene | 20013 | 5.8% |
| oligocene | 15521 | 4.5% |
| paleocene | 7752 | 2.2% |
| holocene | 1481 | 0.4% |
| Other values (10) | 355 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 520801 | |
| o | 184703 | 8.4% |
| n | 183525 | 8.3% |
| c | 183200 | 8.3% |
| l | 183151 | 8.3% |
| i | 175926 | 8.0% |
| d | 137364 | 6.2% |
| M | 107985 | 4.9% |
| E | 104453 | 4.7% |
| a | 104017 | 4.7% |
| Other values (22) | 324681 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1862169 | |
| Uppercase Letter | 347612 | 15.7% |
| Other Punctuation | 25 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 520801 | |
| o | 184703 | 9.9% |
| n | 183525 | 9.9% |
| c | 183200 | 9.8% |
| l | 183151 | 9.8% |
| i | 175926 | 9.4% |
| d | 137364 | 7.4% |
| a | 104017 | 5.6% |
| t | 78031 | 4.2% |
| r | 37590 | 2.0% |
| Other values (9) | 73861 | 4.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 107985 | |
| E | 104453 | |
| P | 59809 | |
| L | 58036 | |
| O | 15517 | 4.5% |
| H | 1481 | 0.4% |
| G | 195 | 0.1% |
| C | 77 | < 0.1% |
| D | 27 | < 0.1% |
| U | 25 | < 0.1% |
| Other values (2) | 7 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 25 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2209781 | |
| Common | 25 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 520801 | |
| o | 184703 | 8.4% |
| n | 183525 | 8.3% |
| c | 183200 | 8.3% |
| l | 183151 | 8.3% |
| i | 175926 | 8.0% |
| d | 137364 | 6.2% |
| M | 107985 | 4.9% |
| E | 104453 | 4.7% |
| a | 104017 | 4.7% |
| Other values (21) | 324656 |
Common
| Value | Count | Frequency (%) |
| / | 25 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2209806 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 520801 | |
| o | 184703 | 8.4% |
| n | 183525 | 8.3% |
| c | 183200 | 8.3% |
| l | 183151 | 8.3% |
| i | 175926 | 8.0% |
| d | 137364 | 6.2% |
| M | 107985 | 4.9% |
| E | 104453 | 4.7% |
| a | 104017 | 4.7% |
| Other values (22) | 324681 |
latestEpochOrHighestSeries
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 718290 |
| Missing (%) | 99.1% |
| Memory size | 5.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 9 |
| Mean length | 7.33708588 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Middle |
|---|---|
| 2nd row | Pliocene |
| 3rd row | Late |
| 4th row | Pleistocene |
| 5th row | Miocene |
| Value | Count | Frequency (%) |
| pliocene | 2384 | |
| eocene | 1075 | |
| miocene | 759 | 12.2% |
| late | 645 | 10.4% |
| pleistocene | 645 | 10.4% |
| middle | 364 | 5.9% |
| oligocene | 188 | 3.0% |
| paleocene | 97 | 1.6% |
| early | 34 | 0.5% |
| holocene | 14 | 0.2% |
| Other values (2) | 13 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 12099 | |
| o | 5177 | |
| n | 5176 | |
| c | 5174 | |
| i | 4342 | 9.5% |
| l | 3726 | 8.2% |
| P | 3126 | 6.9% |
| t | 1302 | 2.9% |
| M | 1123 | 2.5% |
| E | 1109 | 2.4% |
| Other values (11) | 3268 | 7.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 39404 | |
| Uppercase Letter | 6218 | 13.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 12099 | |
| o | 5177 | |
| n | 5176 | |
| c | 5174 | |
| i | 4342 | 11.0% |
| l | 3726 | 9.5% |
| t | 1302 | 3.3% |
| a | 777 | 2.0% |
| d | 728 | 1.8% |
| s | 645 | 1.6% |
| Other values (4) | 258 | 0.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 3126 | |
| M | 1123 | 18.1% |
| E | 1109 | 17.8% |
| L | 646 | 10.4% |
| O | 188 | 3.0% |
| H | 14 | 0.2% |
| R | 12 | 0.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 45622 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 12099 | |
| o | 5177 | |
| n | 5176 | |
| c | 5174 | |
| i | 4342 | 9.5% |
| l | 3726 | 8.2% |
| P | 3126 | 6.9% |
| t | 1302 | 2.9% |
| M | 1123 | 2.5% |
| E | 1109 | 2.4% |
| Other values (11) | 3268 | 7.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 45622 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 12099 | |
| o | 5177 | |
| n | 5176 | |
| c | 5174 | |
| i | 4342 | 9.5% |
| l | 3726 | 8.2% |
| P | 3126 | 6.9% |
| t | 1302 | 2.9% |
| M | 1123 | 2.5% |
| E | 1109 | 2.4% |
| Other values (11) | 3268 | 7.2% |
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 562472 |
| Missing (%) | 77.6% |
| Memory size | 5.5 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 19 |
| Mean length | 9.036053716 |
| Min length | 4 |
Unique
| Unique | 38 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Anisian |
|---|---|
| 2nd row | Hemphillian |
| 3rd row | Middle |
| 4th row | Emsian |
| 5th row | Irvingtonian |
| Value | Count | Frequency (%) |
| hemphillian | 19681 | 12.1% |
| middle | 17380 | 10.7% |
| wasatchian | 7037 | 4.3% |
| early | 5466 | 3.4% |
| orellan | 5085 | 3.1% |
| bridgerian | 4799 | 2.9% |
| maastrichtian | 4686 | 2.9% |
| campanian | 4051 | 2.5% |
| chadronian | 3871 | 2.4% |
| ypresian | 3476 | 2.1% |
| Other values (350) | 87399 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 228885 | |
| n | 195907 | |
| i | 190767 | |
| e | 105142 | 7.2% |
| l | 96307 | 6.6% |
| r | 75689 | 5.2% |
| d | 61340 | 4.2% |
| o | 52724 | 3.6% |
| h | 47497 | 3.2% |
| s | 40454 | 2.8% |
| Other values (44) | 369454 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1300773 | |
| Uppercase Letter | 162483 | 11.1% |
| Space Separator | 895 | 0.1% |
| Other Punctuation | 13 | < 0.1% |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 228885 | |
| n | 195907 | |
| i | 190767 | |
| e | 105142 | |
| l | 96307 | |
| r | 75689 | 5.8% |
| d | 61340 | 4.7% |
| o | 52724 | 4.1% |
| h | 47497 | 3.7% |
| s | 40454 | 3.1% |
| Other values (16) | 206061 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 28152 | |
| C | 21480 | |
| H | 20672 | |
| W | 12315 | |
| B | 10522 | 6.5% |
| O | 10358 | 6.4% |
| T | 8937 | 5.5% |
| E | 7395 | 4.6% |
| A | 6493 | 4.0% |
| L | 6455 | 4.0% |
| Other values (14) | 29704 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 12 | |
| , | 1 | 7.7% |
Space Separator
| Value | Count | Frequency (%) |
| 895 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1463256 | |
| Common | 910 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 228885 | |
| n | 195907 | |
| i | 190767 | |
| e | 105142 | 7.2% |
| l | 96307 | 6.6% |
| r | 75689 | 5.2% |
| d | 61340 | 4.2% |
| o | 52724 | 3.6% |
| h | 47497 | 3.2% |
| s | 40454 | 2.8% |
| Other values (40) | 368544 |
Common
| Value | Count | Frequency (%) |
| 895 | ||
| / | 12 | 1.3% |
| 4 | 2 | 0.2% |
| , | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1464166 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 228885 | |
| n | 195907 | |
| i | 190767 | |
| e | 105142 | 7.2% |
| l | 96307 | 6.6% |
| r | 75689 | 5.2% |
| d | 61340 | 4.2% |
| o | 52724 | 3.6% |
| h | 47497 | 3.2% |
| s | 40454 | 2.8% |
| Other values (44) | 369454 |
Missing 
| Distinct | 35 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 722133 |
| Missing (%) | 99.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 8 |
| Mean length | 8.232 |
| Min length | 4 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Givetian |
|---|---|
| 2nd row | Turonian |
| 3rd row | Gelasian |
| 4th row | Gelasian |
| 5th row | Gelasian |
| Value | Count | Frequency (%) |
| lutetian | 829 | |
| zanclean | 319 | 13.4% |
| tortonian | 217 | 9.1% |
| gelasian | 200 | 8.4% |
| maastrichtian | 105 | 4.4% |
| late | 98 | 4.1% |
| messinian | 78 | 3.3% |
| thanetian | 78 | 3.3% |
| ypresian | 60 | 2.5% |
| langhian | 58 | 2.4% |
| Other values (25) | 333 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 3358 | |
| n | 3107 | |
| t | 2287 | |
| i | 2268 | |
| e | 1838 | |
| L | 1015 | 5.2% |
| u | 862 | 4.4% |
| l | 662 | 3.4% |
| o | 553 | 2.8% |
| s | 534 | 2.7% |
| Other values (28) | 3067 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17176 | |
| Uppercase Letter | 2375 | 12.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3358 | |
| n | 3107 | |
| t | 2287 | |
| i | 2268 | |
| e | 1838 | |
| u | 862 | 5.0% |
| l | 662 | 3.9% |
| o | 553 | 3.2% |
| s | 534 | 3.1% |
| r | 515 | 3.0% |
| Other values (13) | 1192 | 6.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 1015 | |
| Z | 319 | 13.4% |
| T | 297 | 12.5% |
| G | 223 | 9.4% |
| M | 196 | 8.3% |
| E | 90 | 3.8% |
| Y | 60 | 2.5% |
| P | 53 | 2.2% |
| C | 50 | 2.1% |
| B | 32 | 1.3% |
| Other values (5) | 40 | 1.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 19551 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3358 | |
| n | 3107 | |
| t | 2287 | |
| i | 2268 | |
| e | 1838 | |
| L | 1015 | 5.2% |
| u | 862 | 4.4% |
| l | 662 | 3.4% |
| o | 553 | 2.8% |
| s | 534 | 2.7% |
| Other values (28) | 3067 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19551 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 3358 | |
| n | 3107 | |
| t | 2287 | |
| i | 2268 | |
| e | 1838 | |
| L | 1015 | 5.2% |
| u | 862 | 4.4% |
| l | 662 | 3.4% |
| o | 553 | 2.8% |
| s | 534 | 2.7% |
| Other values (28) | 3067 |
group
Text
Missing 
| Distinct | 557 |
|---|---|
| Distinct (%) | 0.6% |
| Missing | 633218 |
| Missing (%) | 87.4% |
| Memory size | 5.5 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 28 |
| Mean length | 14.80891664 |
| Min length | 1 |
Unique
| Unique | 146 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Star Peak Group |
|---|---|
| 2nd row | Chesapeake Group |
| 3rd row | Keokuk Group |
| 4th row | Chesapeake Group |
| 5th row | Chesapeake Group |
| Value | Count | Frequency (%) |
| group | 90331 | |
| chesapeake | 38410 | |
| river | 7802 | 4.0% |
| white | 5751 | 3.0% |
| selma | 3439 | 1.8% |
| kewanee | 2702 | 1.4% |
| hamilton | 2337 | 1.2% |
| osage | 2256 | 1.2% |
| washita | 1421 | 0.7% |
| pamunkey | 1419 | 0.7% |
| Other values (577) | 37508 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 166874 | |
| p | 131366 | |
| a | 118438 | 8.8% |
| r | 115845 | 8.6% |
| o | 113583 | 8.4% |
| 102086 | 7.6% | |
| u | 98547 | 7.3% |
| G | 90741 | 6.7% |
| s | 54633 | 4.0% |
| h | 50628 | 3.7% |
| Other values (47) | 309165 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1056168 | |
| Uppercase Letter | 193474 | 14.3% |
| Space Separator | 102086 | 7.6% |
| Other Punctuation | 124 | < 0.1% |
| Open Punctuation | 21 | < 0.1% |
| Close Punctuation | 21 | < 0.1% |
| Dash Punctuation | 12 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 166874 | |
| p | 131366 | |
| a | 118438 | |
| r | 115845 | |
| o | 113583 | |
| u | 98547 | |
| s | 54633 | 5.2% |
| h | 50628 | 4.8% |
| k | 45139 | 4.3% |
| i | 34291 | 3.2% |
| Other values (16) | 126824 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 90741 | |
| C | 43143 | |
| R | 9045 | 4.7% |
| W | 8105 | 4.2% |
| S | 6248 | 3.2% |
| M | 4589 | 2.4% |
| P | 4340 | 2.2% |
| K | 3671 | 1.9% |
| O | 3592 | 1.9% |
| H | 3351 | 1.7% |
| Other values (15) | 16649 | 8.6% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 88 | |
| , | 36 |
Space Separator
| Value | Count | Frequency (%) |
| 102086 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 21 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 21 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 12 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1249642 | |
| Common | 102264 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 166874 | |
| p | 131366 | |
| a | 118438 | |
| r | 115845 | |
| o | 113583 | |
| u | 98547 | 7.9% |
| G | 90741 | 7.3% |
| s | 54633 | 4.4% |
| h | 50628 | 4.1% |
| k | 45139 | 3.6% |
| Other values (41) | 263848 |
Common
| Value | Count | Frequency (%) |
| 102086 | ||
| . | 88 | 0.1% |
| , | 36 | < 0.1% |
| ( | 21 | < 0.1% |
| ) | 21 | < 0.1% |
| - | 12 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1351906 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 166874 | |
| p | 131366 | |
| a | 118438 | 8.8% |
| r | 115845 | 8.6% |
| o | 113583 | 8.4% |
| 102086 | 7.6% | |
| u | 98547 | 7.3% |
| G | 90741 | 6.7% |
| s | 54633 | 4.0% |
| h | 50628 | 3.7% |
| Other values (47) | 309165 |
formation
Text
Missing 
| Distinct | 5419 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 365706 |
| Missing (%) | 50.5% |
| Memory size | 5.5 MiB |
Length
| Max length | 46 |
|---|---|
| Median length | 38 |
| Mean length | 11.49027319 |
| Min length | 3 |
Unique
| Unique | 1482 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Prida Fm |
|---|---|
| 2nd row | Yorktown Fm |
| 3rd row | Skinner Ranch Fm |
| 4th row | San Pedro Fm |
| 5th row | Grande Greve Fm |
| Value | Count | Frequency (%) |
| fm | 259134 | |
| river | 44301 | 5.5% |
| ls | 39737 | 4.9% |
| stephen | 31376 | 3.9% |
| green | 29207 | 3.6% |
| yorktown | 23754 | 2.9% |
| unknown | 18762 | 2.3% |
| sh | 17735 | 2.2% |
| pungo | 10262 | 1.3% |
| canyon | 8111 | 1.0% |
| Other values (4425) | 326422 |
Most occurring characters
| Value | Count | Frequency (%) |
| 449999 | 10.9% | |
| e | 361227 | 8.8% |
| n | 317355 | 7.7% |
| m | 288475 | 7.0% |
| F | 271104 | 6.6% |
| r | 245377 | 6.0% |
| o | 238913 | 5.8% |
| a | 212844 | 5.2% |
| i | 166070 | 4.0% |
| t | 160119 | 3.9% |
| Other values (56) | 1411250 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2858690 | |
| Uppercase Letter | 809683 | 19.6% |
| Space Separator | 449999 | 10.9% |
| Other Punctuation | 3867 | 0.1% |
| Decimal Number | 156 | < 0.1% |
| Open Punctuation | 135 | < 0.1% |
| Close Punctuation | 134 | < 0.1% |
| Dash Punctuation | 69 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 361227 | |
| n | 317355 | |
| m | 288475 | |
| r | 245377 | 8.6% |
| o | 238913 | 8.4% |
| a | 212844 | 7.4% |
| i | 166070 | 5.8% |
| t | 160119 | 5.6% |
| l | 128749 | 4.5% |
| s | 112733 | 3.9% |
| Other values (16) | 626828 |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 271104 | |
| S | 78359 | 9.7% |
| R | 63222 | 7.8% |
| L | 61354 | 7.6% |
| C | 52642 | 6.5% |
| G | 37852 | 4.7% |
| B | 36649 | 4.5% |
| M | 26756 | 3.3% |
| P | 26718 | 3.3% |
| Y | 24537 | 3.0% |
| Other values (15) | 130490 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2426 | |
| , | 703 | 18.2% |
| ? | 651 | 16.8% |
| ' | 64 | 1.7% |
| / | 19 | 0.5% |
| " | 4 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 147 | |
| 3 | 3 | 1.9% |
| 9 | 2 | 1.3% |
| 2 | 2 | 1.3% |
| 0 | 2 | 1.3% |
Space Separator
| Value | Count | Frequency (%) |
| 449999 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 135 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 134 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 69 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3668373 | |
| Common | 454360 | 11.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 361227 | 9.8% |
| n | 317355 | 8.7% |
| m | 288475 | 7.9% |
| F | 271104 | 7.4% |
| r | 245377 | 6.7% |
| o | 238913 | 6.5% |
| a | 212844 | 5.8% |
| i | 166070 | 4.5% |
| t | 160119 | 4.4% |
| l | 128749 | 3.5% |
| Other values (41) | 1278140 |
Common
| Value | Count | Frequency (%) |
| 449999 | ||
| . | 2426 | 0.5% |
| , | 703 | 0.2% |
| ? | 651 | 0.1% |
| 1 | 147 | < 0.1% |
| ( | 135 | < 0.1% |
| ) | 134 | < 0.1% |
| - | 69 | < 0.1% |
| ' | 64 | < 0.1% |
| / | 19 | < 0.1% |
| Other values (5) | 13 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4122733 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 449999 | 10.9% | |
| e | 361227 | 8.8% |
| n | 317355 | 7.7% |
| m | 288475 | 7.0% |
| F | 271104 | 6.6% |
| r | 245377 | 6.0% |
| o | 238913 | 5.8% |
| a | 212844 | 5.2% |
| i | 166070 | 4.0% |
| t | 160119 | 3.9% |
| Other values (56) | 1411250 |
member
Text
Missing 
| Distinct | 1626 |
|---|---|
| Distinct (%) | 2.0% |
| Missing | 643191 |
| Missing (%) | 88.8% |
| Memory size | 5.5 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 30 |
| Mean length | 13.99831524 |
| Min length | 1 |
Unique
| Unique | 471 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | Fossil Hill Mbr |
|---|---|
| 2nd row | Decie Ranch Mbr |
| 3rd row | Millersburg Mbr |
| 4th row | Thin-Bedded Zone Of Udden |
| 5th row | Burgess Sh Mbr |
| Value | Count | Frequency (%) |
| mbr | 79698 | |
| sh | 36967 | |
| burgess | 30811 | 13.2% |
| ls | 6535 | 2.8% |
| creek | 4230 | 1.8% |
| sunken | 3525 | 1.5% |
| meadow | 3525 | 1.5% |
| ranch | 3361 | 1.4% |
| francis | 2603 | 1.1% |
| b | 2492 | 1.1% |
| Other values (1500) | 60135 |
Most occurring characters
| Value | Count | Frequency (%) |
| 152565 | ||
| r | 138201 | |
| M | 87327 | 7.7% |
| s | 86157 | 7.6% |
| b | 84523 | 7.4% |
| e | 79157 | 7.0% |
| h | 47967 | 4.2% |
| S | 46866 | 4.1% |
| u | 42615 | 3.7% |
| a | 41195 | 3.6% |
| Other values (60) | 331728 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 749978 | |
| Uppercase Letter | 232978 | 20.5% |
| Space Separator | 152565 | 13.4% |
| Decimal Number | 2131 | 0.2% |
| Other Punctuation | 324 | < 0.1% |
| Dash Punctuation | 290 | < 0.1% |
| Open Punctuation | 17 | < 0.1% |
| Close Punctuation | 17 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 138201 | |
| s | 86157 | |
| b | 84523 | |
| e | 79157 | |
| h | 47967 | 6.4% |
| u | 42615 | 5.7% |
| a | 41195 | 5.5% |
| g | 38517 | 5.1% |
| n | 36464 | 4.9% |
| i | 27554 | 3.7% |
| Other values (16) | 127628 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 87327 | |
| S | 46866 | |
| B | 39596 | |
| C | 10761 | 4.6% |
| L | 9429 | 4.0% |
| R | 5451 | 2.3% |
| F | 4926 | 2.1% |
| P | 4323 | 1.9% |
| G | 4164 | 1.8% |
| W | 4116 | 1.8% |
| Other values (15) | 16019 | 6.9% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 858 | |
| 2 | 337 | 15.8% |
| 3 | 289 | 13.6% |
| 4 | 247 | 11.6% |
| 5 | 130 | 6.1% |
| 0 | 124 | 5.8% |
| 6 | 102 | 4.8% |
| 7 | 24 | 1.1% |
| 9 | 16 | 0.8% |
| 8 | 4 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 131 | |
| . | 128 | |
| ? | 64 | |
| ' | 1 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 152565 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 290 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 17 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 17 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 982956 | |
| Common | 155345 | 13.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 138201 | |
| M | 87327 | 8.9% |
| s | 86157 | 8.8% |
| b | 84523 | 8.6% |
| e | 79157 | 8.1% |
| h | 47967 | 4.9% |
| S | 46866 | 4.8% |
| u | 42615 | 4.3% |
| a | 41195 | 4.2% |
| B | 39596 | 4.0% |
| Other values (41) | 289352 |
Common
| Value | Count | Frequency (%) |
| 152565 | ||
| 1 | 858 | 0.6% |
| 2 | 337 | 0.2% |
| - | 290 | 0.2% |
| 3 | 289 | 0.2% |
| 4 | 247 | 0.2% |
| , | 131 | 0.1% |
| 5 | 130 | 0.1% |
| . | 128 | 0.1% |
| 0 | 124 | 0.1% |
| Other values (9) | 246 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1138301 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 152565 | ||
| r | 138201 | |
| M | 87327 | 7.7% |
| s | 86157 | 7.6% |
| b | 84523 | 7.4% |
| e | 79157 | 7.0% |
| h | 47967 | 4.2% |
| S | 46866 | 4.1% |
| u | 42615 | 3.7% |
| a | 41195 | 3.6% |
| Other values (60) | 331728 |
typeStatus
Text
Missing 
| Distinct | 57 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 581882 |
| Missing (%) | 80.3% |
| Memory size | 5.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 8 |
| Mean length | 7.816414959 |
| Min length | 4 |
Unique
| Unique | 18 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Paratype |
|---|---|
| 2nd row | Paratype |
| 3rd row | Paratype |
| 4th row | Type |
| 5th row | Holotype |
| Value | Count | Frequency (%) |
| paratype | 74620 | |
| holotype | 34727 | |
| syntype | 19596 | 13.7% |
| type | 7957 | 5.6% |
| paralectotype | 2999 | 2.1% |
| lectotype | 1087 | 0.8% |
| plastoholotype | 595 | 0.4% |
| plastotype | 390 | 0.3% |
| plastoparatype | 282 | 0.2% |
| plastosyntype | 253 | 0.2% |
| Other values (12) | 325 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| y | 162651 | |
| a | 157416 | |
| e | 147041 | |
| p | 143090 | |
| t | 140517 | |
| P | 79203 | |
| r | 77963 | |
| o | 76542 | |
| l | 39911 | 3.6% |
| H | 34727 | 3.1% |
| Other values (15) | 55763 | 5.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 971641 | |
| Uppercase Letter | 142831 | 12.8% |
| Space Separator | 205 | < 0.1% |
| Other Punctuation | 147 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| y | 162651 | |
| a | 157416 | |
| e | 147041 | |
| p | 143090 | |
| t | 140517 | |
| r | 77963 | |
| o | 76542 | |
| l | 39911 | 4.1% |
| n | 19880 | 2.0% |
| c | 4119 | 0.4% |
| Other values (3) | 2511 | 0.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 79203 | |
| H | 34727 | |
| S | 19625 | 13.7% |
| T | 7957 | 5.6% |
| L | 1087 | 0.8% |
| N | 143 | 0.1% |
| O | 29 | < 0.1% |
| I | 28 | < 0.1% |
| M | 19 | < 0.1% |
| C | 13 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 205 |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 147 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1114472 | |
| Common | 352 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| y | 162651 | |
| a | 157416 | |
| e | 147041 | |
| p | 143090 | |
| t | 140517 | |
| P | 79203 | |
| r | 77963 | |
| o | 76542 | |
| l | 39911 | 3.6% |
| H | 34727 | 3.1% |
| Other values (13) | 55411 | 5.0% |
Common
| Value | Count | Frequency (%) |
| 205 | ||
| ; | 147 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1114824 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| y | 162651 | |
| a | 157416 | |
| e | 147041 | |
| p | 143090 | |
| t | 140517 | |
| P | 79203 | |
| r | 77963 | |
| o | 76542 | |
| l | 39911 | 3.6% |
| H | 34727 | 3.1% |
| Other values (15) | 55763 | 5.0% |
identifiedBy
Text
Missing 
| Distinct | 2463 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 521981 |
| Missing (%) | 72.0% |
| Memory size | 5.5 MiB |
Length
| Max length | 147 |
|---|---|
| Median length | 124 |
| Mean length | 22.47668212 |
| Min length | 2 |
Unique
| Unique | 535 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Silberling; Nichols |
|---|---|
| 2nd row | Vaughan |
| 3rd row | Harper; Boucot |
| 4th row | Said; Barakat, M. G. |
| 5th row | Smith |
| Value | Count | Frequency (%) |
| united | 21468 | 3.2% |
| states | 21082 | 3.2% |
| of | 20281 | 3.1% |
| museum | 15734 | 2.4% |
| helen | 15316 | 2.3% |
| 12006 | 1.8% | |
| natural | 11887 | 1.8% |
| history | 11620 | 1.8% |
| institution | 11572 | 1.7% |
| smithsonian | 11571 | 1.7% |
| Other values (2466) | 510240 |
Most occurring characters
| Value | Count | Frequency (%) |
| 460250 | 10.1% | |
| e | 280098 | 6.2% |
| o | 272102 | 6.0% |
| a | 259642 | 5.7% |
| n | 241275 | 5.3% |
| t | 230888 | 5.1% |
| r | 226036 | 5.0% |
| i | 214007 | 4.7% |
| l | 181066 | 4.0% |
| s | 174306 | 3.8% |
| Other values (58) | 2012465 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2806351 | |
| Uppercase Letter | 908175 | 20.0% |
| Space Separator | 460250 | 10.1% |
| Other Punctuation | 280258 | 6.2% |
| Close Punctuation | 40168 | 0.9% |
| Open Punctuation | 40168 | 0.9% |
| Dash Punctuation | 16765 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 280098 | |
| o | 272102 | |
| a | 259642 | |
| n | 241275 | 8.6% |
| t | 230888 | 8.2% |
| r | 226036 | 8.1% |
| i | 214007 | 7.6% |
| l | 181066 | 6.5% |
| s | 174306 | 6.2% |
| u | 121224 | 4.3% |
| Other values (22) | 605707 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 117932 | 13.0% |
| T | 78022 | 8.6% |
| A | 60143 | 6.6% |
| N | 59104 | 6.5% |
| C | 57622 | 6.3% |
| E | 56100 | 6.2% |
| I | 46266 | 5.1% |
| D | 44046 | 4.8% |
| H | 42705 | 4.7% |
| U | 40270 | 4.4% |
| Other values (16) | 305965 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 138675 | |
| . | 77116 | |
| ; | 64257 | |
| / | 177 | 0.1% |
| ' | 23 | < 0.1% |
| & | 10 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 460250 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 40168 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 40168 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 16765 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3714526 | |
| Common | 837609 | 18.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 280098 | 7.5% |
| o | 272102 | 7.3% |
| a | 259642 | 7.0% |
| n | 241275 | 6.5% |
| t | 230888 | 6.2% |
| r | 226036 | 6.1% |
| i | 214007 | 5.8% |
| l | 181066 | 4.9% |
| s | 174306 | 4.7% |
| u | 121224 | 3.3% |
| Other values (48) | 1513882 |
Common
| Value | Count | Frequency (%) |
| 460250 | ||
| , | 138675 | 16.6% |
| . | 77116 | 9.2% |
| ; | 64257 | 7.7% |
| ) | 40168 | 4.8% |
| ( | 40168 | 4.8% |
| - | 16765 | 2.0% |
| / | 177 | < 0.1% |
| ' | 23 | < 0.1% |
| & | 10 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4550350 | |
| None | 1785 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 460250 | 10.1% | |
| e | 280098 | 6.2% |
| o | 272102 | 6.0% |
| a | 259642 | 5.7% |
| n | 241275 | 5.3% |
| t | 230888 | 5.1% |
| r | 226036 | 5.0% |
| i | 214007 | 4.7% |
| l | 181066 | 4.0% |
| s | 174306 | 3.8% |
| Other values (52) | 2010680 |
None
| Value | Count | Frequency (%) |
| ñ | 1143 | |
| ý | 251 | 14.1% |
| š | 251 | 14.1% |
| ö | 138 | 7.7% |
| ú | 1 | 0.1% |
| í | 1 | 0.1% |
scientificName
Text
Missing 
| Distinct | 97401 |
|---|---|
| Distinct (%) | 17.6% |
| Missing | 171332 |
| Missing (%) | 23.6% |
| Memory size | 5.5 MiB |
Length
| Max length | 62 |
|---|---|
| Median length | 56 |
| Mean length | 18.07695742 |
| Min length | 5 |
Unique
| Unique | 44766 ? |
|---|---|
| Unique (%) | 8.1% |
Sample
| 1st row | Damaliscus lunatus |
|---|---|
| 2nd row | Acrochordiceras hyatti |
| 3rd row | Discocyclina (Asterocyclina) sculpturata |
| 4th row | Odontaspis cuspidata |
| 5th row | Enteletes rotundobesus |
| Value | Count | Frequency (%) |
| sp | 136960 | 12.1% |
| genus | 56232 | 5.0% |
| insecta | 16851 | 1.5% |
| splendens | 12400 | 1.1% |
| marrella | 12281 | 1.1% |
| pterodroma | 7305 | 0.6% |
| var | 6498 | 0.6% |
| callophoca | 3770 | 0.3% |
| isurus | 3463 | 0.3% |
| ostracoda | 3391 | 0.3% |
| Other values (53913) | 873954 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1021294 | 10.2% |
| s | 909134 | 9.1% |
| i | 819278 | 8.2% |
| e | 762530 | 7.6% |
| o | 610330 | 6.1% |
| r | 609311 | 6.1% |
| n | 592254 | 5.9% |
| 579929 | 5.8% | |
| l | 537519 | 5.4% |
| u | 466436 | 4.7% |
| Other values (62) | 3091724 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8787040 | |
| Space Separator | 579929 | 5.8% |
| Uppercase Letter | 575487 | 5.8% |
| Close Punctuation | 22326 | 0.2% |
| Open Punctuation | 22314 | 0.2% |
| Other Punctuation | 10186 | 0.1% |
| Decimal Number | 1938 | < 0.1% |
| Dash Punctuation | 518 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1021294 | |
| s | 909134 | |
| i | 819278 | |
| e | 762530 | 8.7% |
| o | 610330 | 6.9% |
| r | 609311 | 6.9% |
| n | 592254 | 6.7% |
| l | 537519 | 6.1% |
| u | 466436 | 5.3% |
| t | 465047 | 5.3% |
| Other values (16) | 1993907 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 79813 | |
| P | 69195 | |
| C | 60147 | |
| A | 39927 | 6.9% |
| M | 39806 | 6.9% |
| S | 35677 | 6.2% |
| B | 27831 | 4.8% |
| H | 26616 | 4.6% |
| T | 26590 | 4.6% |
| I | 25413 | 4.4% |
| Other values (16) | 144472 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 962 | |
| 2 | 543 | |
| 3 | 206 | 10.6% |
| 4 | 92 | 4.7% |
| 5 | 67 | 3.5% |
| 6 | 38 | 2.0% |
| 7 | 19 | 1.0% |
| 8 | 5 | 0.3% |
| 0 | 4 | 0.2% |
| 9 | 2 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 10146 | |
| ' | 21 | 0.2% |
| ? | 13 | 0.1% |
| * | 5 | < 0.1% |
| # | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 579929 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 22326 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 22314 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 518 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9362527 | |
| Common | 637212 | 6.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1021294 | 10.9% |
| s | 909134 | 9.7% |
| i | 819278 | 8.8% |
| e | 762530 | 8.1% |
| o | 610330 | 6.5% |
| r | 609311 | 6.5% |
| n | 592254 | 6.3% |
| l | 537519 | 5.7% |
| u | 466436 | 5.0% |
| t | 465047 | 5.0% |
| Other values (42) | 2569394 |
Common
| Value | Count | Frequency (%) |
| 579929 | ||
| ) | 22326 | 3.5% |
| ( | 22314 | 3.5% |
| . | 10146 | 1.6% |
| 1 | 962 | 0.2% |
| 2 | 543 | 0.1% |
| - | 518 | 0.1% |
| 3 | 206 | < 0.1% |
| 4 | 92 | < 0.1% |
| 5 | 67 | < 0.1% |
| Other values (10) | 109 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9999739 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1021294 | 10.2% |
| s | 909134 | 9.1% |
| i | 819278 | 8.2% |
| e | 762530 | 7.6% |
| o | 610330 | 6.1% |
| r | 609311 | 6.1% |
| n | 592254 | 5.9% |
| 579929 | 5.8% | |
| l | 537519 | 5.4% |
| u | 466436 | 4.7% |
| Other values (62) | 3091724 |
Missing 
| Distinct | 3844 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 172643 |
| Missing (%) | 23.8% |
| Memory size | 5.5 MiB |
Length
| Max length | 141 |
|---|---|
| Median length | 123 |
| Mean length | 59.08444638 |
| Min length | 5 |
Unique
| Unique | 743 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Animalia, Chordata, Vertebrata, Mammalia, Eutheria, Laurasiatheria, Artiodactyla, Ruminatia, Bovidae |
|---|---|
| 2nd row | Animalia, Mollusca, Cephalopoda, Ammonoidea |
| 3rd row | Chromista, Foraminifera, Globothalamea, Rotaliida, Discocyclinidae |
| 4th row | Animalia, Chordata, Vertebrata, Pisces, Chondrichthyes, Elasmobranchii, Galeomorphii, Lamniformes, Odontaspididae |
| 5th row | Animalia, Brachiopoda, Rhynchonellata, Orthida, Enteletidae |
| Value | Count | Frequency (%) |
| animalia | 448323 | 15.7% |
| chordata | 148700 | 5.2% |
| vertebrata | 148618 | 5.2% |
| arthropoda | 100318 | 3.5% |
| mollusca | 69025 | 2.4% |
| brachiopoda | 66748 | 2.3% |
| foraminifera | 66301 | 2.3% |
| chromista | 65999 | 2.3% |
| mammalia | 60027 | 2.1% |
| eutheria | 57586 | 2.0% |
| Other values (3834) | 1620986 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4706865 | |
| i | 3184420 | 9.8% |
| 2300766 | 7.1% | |
| , | 2260526 | 6.9% |
| o | 2052009 | 6.3% |
| r | 2005114 | 6.1% |
| e | 1809015 | 5.5% |
| t | 1671086 | 5.1% |
| l | 1501858 | 4.6% |
| n | 1400746 | 4.3% |
| Other values (51) | 9714233 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25197474 | |
| Uppercase Letter | 2811914 | 8.6% |
| Space Separator | 2300766 | 7.1% |
| Other Punctuation | 2295928 | 7.0% |
| Decimal Number | 471 | < 0.1% |
| Open Punctuation | 42 | < 0.1% |
| Close Punctuation | 42 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4706865 | |
| i | 3184420 | |
| o | 2052009 | |
| r | 2005114 | |
| e | 1809015 | 7.2% |
| t | 1671086 | 6.6% |
| l | 1501858 | 6.0% |
| n | 1400746 | 5.6% |
| d | 1257138 | 5.0% |
| m | 1113235 | 4.4% |
| Other values (16) | 4495988 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 662527 | |
| C | 427513 | |
| P | 199516 | 7.1% |
| M | 161377 | 5.7% |
| V | 161299 | 5.7% |
| S | 144831 | 5.2% |
| E | 143204 | 5.1% |
| R | 141162 | 5.0% |
| B | 123534 | 4.4% |
| G | 116236 | 4.1% |
| Other values (16) | 530715 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2260526 | |
| . | 35391 | 1.5% |
| " | 8 | < 0.1% |
| ? | 3 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2300766 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 471 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 42 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 42 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 28009388 | |
| Common | 4597250 | 14.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4706865 | |
| i | 3184420 | |
| o | 2052009 | 7.3% |
| r | 2005114 | 7.2% |
| e | 1809015 | 6.5% |
| t | 1671086 | 6.0% |
| l | 1501858 | 5.4% |
| n | 1400746 | 5.0% |
| d | 1257138 | 4.5% |
| m | 1113235 | 4.0% |
| Other values (42) | 7307902 |
Common
| Value | Count | Frequency (%) |
| 2300766 | ||
| , | 2260526 | |
| . | 35391 | 0.8% |
| 0 | 471 | < 0.1% |
| ( | 42 | < 0.1% |
| ) | 42 | < 0.1% |
| " | 8 | < 0.1% |
| ? | 3 | < 0.1% |
| - | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 32606638 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4706865 | |
| i | 3184420 | 9.8% |
| 2300766 | 7.1% | |
| , | 2260526 | 6.9% |
| o | 2052009 | 6.3% |
| r | 2005114 | 6.1% |
| e | 1809015 | 5.5% |
| t | 1671086 | 5.1% |
| l | 1501858 | 4.6% |
| n | 1400746 | 4.3% |
| Other values (51) | 9714233 |
kingdom
Text
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 172847 |
| Missing (%) | 23.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 8 |
| Mean length | 8.052434375 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Animalia |
|---|---|
| 2nd row | Animalia |
| 3rd row | Chromista |
| 4th row | Animalia |
| 5th row | Animalia |
| Value | Count | Frequency (%) |
| animalia | 448322 | |
| chromista | 65985 | 12.0% |
| plantae | 37205 | 6.7% |
| protoctista | 66 | < 0.1% |
| protozoa | 44 | < 0.1% |
| biota | 28 | < 0.1% |
| incertae | 5 | < 0.1% |
| sedis | 5 | < 0.1% |
| bacteria | 5 | < 0.1% |
| arthropoda | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1037193 | |
| i | 962733 | |
| m | 514307 | |
| n | 485532 | |
| l | 485527 | |
| A | 448323 | |
| t | 103471 | 2.3% |
| o | 66279 | 1.5% |
| r | 66107 | 1.5% |
| s | 66061 | 1.5% |
| Other values (11) | 206681 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3890548 | |
| Uppercase Letter | 551661 | 12.4% |
| Space Separator | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1037193 | |
| i | 962733 | |
| m | 514307 | |
| n | 485532 | |
| l | 485527 | |
| t | 103471 | 2.7% |
| o | 66279 | 1.7% |
| r | 66107 | 1.7% |
| s | 66061 | 1.7% |
| h | 65986 | 1.7% |
| Other values (5) | 37352 | 1.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 448323 | |
| C | 65985 | 12.0% |
| P | 37315 | 6.8% |
| B | 33 | < 0.1% |
| I | 5 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4442209 | |
| Common | 5 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1037193 | |
| i | 962733 | |
| m | 514307 | |
| n | 485532 | |
| l | 485527 | |
| A | 448323 | |
| t | 103471 | 2.3% |
| o | 66279 | 1.5% |
| r | 66107 | 1.5% |
| s | 66061 | 1.5% |
| Other values (10) | 206676 | 4.7% |
Common
| Value | Count | Frequency (%) |
| 5 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4442214 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1037193 | |
| i | 962733 | |
| m | 514307 | |
| n | 485532 | |
| l | 485527 | |
| A | 448323 | |
| t | 103471 | 2.3% |
| o | 66279 | 1.5% |
| r | 66107 | 1.5% |
| s | 66061 | 1.5% |
| Other values (11) | 206681 | 4.7% |
phylum
Text
Missing 
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 211856 |
| Missing (%) | 29.2% |
| Memory size | 5.5 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 14 |
| Mean length | 9.567853047 |
| Min length | 5 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Chordata |
|---|---|
| 2nd row | Mollusca |
| 3rd row | Foraminifera |
| 4th row | Chordata |
| 5th row | Brachiopoda |
| Value | Count | Frequency (%) |
| chordata | 148700 | |
| arthropoda | 100304 | |
| mollusca | 69025 | |
| brachiopoda | 66748 | |
| foraminifera | 65986 | |
| echinodermata | 26599 | 5.2% |
| bryozoa | 12874 | 2.5% |
| cnidaria | 7243 | 1.4% |
| protozoa | 4080 | 0.8% |
| porifera | 2897 | 0.6% |
| Other values (27) | 8947 | 1.7% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 832296 | |
| o | 688644 | |
| r | 609931 | |
| d | 357317 | 7.3% |
| h | 344816 | 7.0% |
| t | 283255 | 5.8% |
| i | 252208 | 5.1% |
| p | 168801 | 3.4% |
| c | 165860 | 3.4% |
| C | 156159 | 3.2% |
| Other values (24) | 1045692 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4391575 | |
| Uppercase Letter | 512652 | 10.5% |
| Space Separator | 751 | < 0.1% |
| Other Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 832296 | |
| o | 688644 | |
| r | 609931 | |
| d | 357317 | |
| h | 344816 | |
| t | 283255 | 6.4% |
| i | 252208 | 5.7% |
| p | 168801 | 3.8% |
| c | 165860 | 3.8% |
| l | 143009 | 3.3% |
| Other values (10) | 545438 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 156159 | |
| A | 103373 | |
| B | 79622 | |
| M | 69031 | |
| F | 65986 | |
| E | 26614 | 5.2% |
| P | 8593 | 1.7% |
| H | 2435 | 0.5% |
| I | 754 | 0.1% |
| G | 66 | < 0.1% |
| Other values (2) | 19 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 751 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4904227 | |
| Common | 752 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 832296 | |
| o | 688644 | |
| r | 609931 | |
| d | 357317 | 7.3% |
| h | 344816 | 7.0% |
| t | 283255 | 5.8% |
| i | 252208 | 5.1% |
| p | 168801 | 3.4% |
| c | 165860 | 3.4% |
| C | 156159 | 3.2% |
| Other values (22) | 1044940 |
Common
| Value | Count | Frequency (%) |
| 751 | ||
| . | 1 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4904979 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 832296 | |
| o | 688644 | |
| r | 609931 | |
| d | 357317 | 7.3% |
| h | 344816 | 7.0% |
| t | 283255 | 5.8% |
| i | 252208 | 5.1% |
| p | 168801 | 3.4% |
| c | 165860 | 3.4% |
| C | 156159 | 3.2% |
| Other values (24) | 1045692 |
class
Text
Missing 
| Distinct | 145 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 235611 |
| Missing (%) | 32.5% |
| Memory size | 5.5 MiB |
Length
| Max length | 27 |
|---|---|
| Median length | 19 |
| Mean length | 9.967651673 |
| Min length | 4 |
Unique
| Unique | 7 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Mammalia |
|---|---|
| 2nd row | Cephalopoda |
| 3rd row | Globothalamea |
| 4th row | Chondrichthyes |
| 5th row | Rhynchonellata |
| Value | Count | Frequency (%) |
| mammalia | 60027 | 12.2% |
| globothalamea | 41779 | 8.5% |
| rhynchonellata | 39023 | 7.9% |
| aves | 34583 | 7.0% |
| insecta | 29284 | 6.0% |
| chondrichthyes | 26607 | 5.4% |
| gastropoda | 24466 | 5.0% |
| ostracoda | 24047 | 4.9% |
| trilobita | 22871 | 4.7% |
| bivalvia | 22291 | 4.5% |
| Other values (133) | 165921 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 859113 | |
| o | 453975 | 9.3% |
| t | 367169 | 7.5% |
| l | 337501 | 6.9% |
| i | 301652 | 6.2% |
| e | 293993 | 6.0% |
| h | 287732 | 5.9% |
| n | 212707 | 4.4% |
| s | 207229 | 4.3% |
| m | 199854 | 4.1% |
| Other values (39) | 1352230 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4382031 | |
| Uppercase Letter | 488897 | 10.0% |
| Space Separator | 2002 | < 0.1% |
| Other Punctuation | 179 | < 0.1% |
| Open Punctuation | 23 | < 0.1% |
| Close Punctuation | 23 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 859113 | |
| o | 453975 | |
| t | 367169 | |
| l | 337501 | 7.7% |
| i | 301652 | 6.9% |
| e | 293993 | 6.7% |
| h | 287732 | 6.6% |
| n | 212707 | 4.9% |
| s | 207229 | 4.7% |
| m | 199854 | 4.6% |
| Other values (14) | 861106 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 78641 | |
| G | 68696 | |
| M | 63598 | |
| R | 49366 | |
| A | 43821 | |
| O | 34647 | |
| T | 31644 | |
| I | 29906 | 6.1% |
| B | 26753 | 5.5% |
| S | 20027 | 4.1% |
| Other values (11) | 41798 |
Space Separator
| Value | Count | Frequency (%) |
| 2002 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 179 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 23 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 23 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4870928 | |
| Common | 2227 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 859113 | |
| o | 453975 | 9.3% |
| t | 367169 | 7.5% |
| l | 337501 | 6.9% |
| i | 301652 | 6.2% |
| e | 293993 | 6.0% |
| h | 287732 | 5.9% |
| n | 212707 | 4.4% |
| s | 207229 | 4.3% |
| m | 199854 | 4.1% |
| Other values (35) | 1350003 |
Common
| Value | Count | Frequency (%) |
| 2002 | ||
| . | 179 | 8.0% |
| ( | 23 | 1.0% |
| ) | 23 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4873155 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 859113 | |
| o | 453975 | 9.3% |
| t | 367169 | 7.5% |
| l | 337501 | 6.9% |
| i | 301652 | 6.2% |
| e | 293993 | 6.0% |
| h | 287732 | 5.9% |
| n | 212707 | 4.4% |
| s | 207229 | 4.3% |
| m | 199854 | 4.1% |
| Other values (39) | 1352230 |
order
Text
Missing 
| Distinct | 552 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 400004 |
| Missing (%) | 55.2% |
| Memory size | 5.5 MiB |
Length
| Max length | 28 |
|---|---|
| Median length | 22 |
| Mean length | 11.13181656 |
| Min length | 1 |
Unique
| Unique | 66 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Artiodactyla |
|---|---|
| 2nd row | Ammonoidea |
| 3rd row | Rotaliida |
| 4th row | Lamniformes |
| 5th row | Orthida |
| Value | Count | Frequency (%) |
| rotaliida | 32318 | 9.7% |
| lamniformes | 12411 | 3.7% |
| spiriferida | 11138 | 3.3% |
| cetacea | 10502 | 3.1% |
| productida | 10020 | 3.0% |
| procellariiformes | 9895 | 3.0% |
| ammonoidea | 9257 | 2.8% |
| order | 9090 | 2.7% |
| artiodactyla | 8886 | 2.7% |
| terebratulida | 8672 | 2.6% |
| Other values (536) | 212022 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 454969 | |
| a | 442612 | |
| r | 320973 | 8.9% |
| o | 301998 | 8.4% |
| e | 264934 | 7.3% |
| d | 249362 | 6.9% |
| t | 203578 | 5.6% |
| l | 161146 | 4.5% |
| s | 140573 | 3.9% |
| n | 136028 | 3.8% |
| Other values (44) | 936146 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3269472 | |
| Uppercase Letter | 324156 | 9.0% |
| Space Separator | 9707 | 0.3% |
| Other Punctuation | 8600 | 0.2% |
| Decimal Number | 348 | < 0.1% |
| Open Punctuation | 18 | < 0.1% |
| Close Punctuation | 18 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 454969 | |
| a | 442612 | |
| r | 320973 | |
| o | 301998 | |
| e | 264934 | |
| d | 249362 | |
| t | 203578 | 6.2% |
| l | 161146 | 4.9% |
| s | 140573 | 4.3% |
| n | 136028 | 4.2% |
| Other values (16) | 593299 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 57757 | |
| R | 46961 | |
| C | 44552 | |
| A | 31461 | |
| S | 29800 | |
| L | 25584 | |
| O | 20494 | 6.3% |
| T | 18568 | 5.7% |
| M | 9915 | 3.1% |
| D | 8635 | 2.7% |
| Other values (12) | 30429 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8599 | |
| , | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 9707 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 348 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 18 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 18 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3593628 | |
| Common | 18691 | 0.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 454969 | |
| a | 442612 | |
| r | 320973 | 8.9% |
| o | 301998 | 8.4% |
| e | 264934 | 7.4% |
| d | 249362 | 6.9% |
| t | 203578 | 5.7% |
| l | 161146 | 4.5% |
| s | 140573 | 3.9% |
| n | 136028 | 3.8% |
| Other values (38) | 917455 |
Common
| Value | Count | Frequency (%) |
| 9707 | ||
| . | 8599 | |
| 0 | 348 | 1.9% |
| ( | 18 | 0.1% |
| ) | 18 | 0.1% |
| , | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3612319 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 454969 | |
| a | 442612 | |
| r | 320973 | 8.9% |
| o | 301998 | 8.4% |
| e | 264934 | 7.3% |
| d | 249362 | 6.9% |
| t | 203578 | 5.6% |
| l | 161146 | 4.5% |
| s | 140573 | 3.9% |
| n | 136028 | 3.8% |
| Other values (44) | 936146 |
family
Text
Missing 
| Distinct | 2441 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 409455 |
| Missing (%) | 56.5% |
| Memory size | 5.5 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 23 |
| Mean length | 12.35823496 |
| Min length | 1 |
Unique
| Unique | 406 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Bovidae |
|---|---|
| 2nd row | Discocyclinidae |
| 3rd row | Odontaspididae |
| 4th row | Enteletidae |
| 5th row | Procellariidae |
| Value | Count | Frequency (%) |
| family | 24920 | 7.3% |
| indet | 24361 | 7.2% |
| procellariidae | 9409 | 2.8% |
| carcharhinidae | 6802 | 2.0% |
| lamnidae | 6398 | 1.9% |
| anatidae | 5246 | 1.5% |
| equidae | 4518 | 1.3% |
| phocidae | 4479 | 1.3% |
| odontaspididae | 3901 | 1.1% |
| vaginulinidae | 3658 | 1.1% |
| Other values (2428) | 246880 |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 562017 | |
| e | 500496 | |
| a | 474982 | |
| d | 376670 | |
| o | 212006 | 5.4% |
| l | 211977 | 5.4% |
| r | 188973 | 4.9% |
| n | 186459 | 4.8% |
| t | 179603 | 4.6% |
| c | 107527 | 2.8% |
| Other values (50) | 892789 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3528570 | |
| Uppercase Letter | 314926 | 8.1% |
| Space Separator | 25519 | 0.7% |
| Other Punctuation | 24358 | 0.6% |
| Decimal Number | 123 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 562017 | |
| e | 500496 | |
| a | 474982 | |
| d | 376670 | |
| o | 212006 | 6.0% |
| l | 211977 | 6.0% |
| r | 188973 | 5.4% |
| n | 186459 | 5.3% |
| t | 179603 | 5.1% |
| c | 107527 | 3.0% |
| Other values (16) | 527860 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 45247 | |
| C | 31894 | 10.1% |
| F | 28184 | 8.9% |
| S | 21920 | 7.0% |
| A | 21504 | 6.8% |
| L | 19683 | 6.3% |
| E | 17192 | 5.5% |
| T | 16574 | 5.3% |
| O | 15064 | 4.8% |
| H | 14161 | 4.5% |
| Other values (16) | 83503 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 24354 | |
| ? | 3 | < 0.1% |
| , | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 25519 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 123 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3843496 | |
| Common | 50003 | 1.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 562017 | |
| e | 500496 | |
| a | 474982 | |
| d | 376670 | |
| o | 212006 | 5.5% |
| l | 211977 | 5.5% |
| r | 188973 | 4.9% |
| n | 186459 | 4.9% |
| t | 179603 | 4.7% |
| c | 107527 | 2.8% |
| Other values (42) | 842786 |
Common
| Value | Count | Frequency (%) |
| 25519 | ||
| . | 24354 | |
| 0 | 123 | 0.2% |
| ? | 3 | < 0.1% |
| ( | 1 | < 0.1% |
| ) | 1 | < 0.1% |
| - | 1 | < 0.1% |
| , | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3893499 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 562017 | |
| e | 500496 | |
| a | 474982 | |
| d | 376670 | |
| o | 212006 | 5.4% |
| l | 211977 | 5.4% |
| r | 188973 | 4.9% |
| n | 186459 | 4.8% |
| t | 179603 | 4.6% |
| c | 107527 | 2.8% |
| Other values (50) | 892789 |
genus
Text
Missing 
| Distinct | 20259 |
|---|---|
| Distinct (%) | 3.8% |
| Missing | 197061 |
| Missing (%) | 27.2% |
| Memory size | 5.5 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 23 |
| Mean length | 9.623302436 |
| Min length | 1 |
Unique
| Unique | 5010 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | Damaliscus |
|---|---|
| 2nd row | Acrochordiceras |
| 3rd row | Discocyclina |
| 4th row | Odontaspis |
| 5th row | Enteletes |
| Value | Count | Frequency (%) |
| genus | 56245 | 10.6% |
| marrella | 12281 | 2.3% |
| pterodroma | 7305 | 1.4% |
| callophoca | 3770 | 0.7% |
| isurus | 3463 | 0.7% |
| physeterula | 3029 | 0.6% |
| carcharhinus | 2930 | 0.6% |
| australca | 2250 | 0.4% |
| thambetochen | 2208 | 0.4% |
| hustedia | 2082 | 0.4% |
| Other values (20248) | 432660 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 526234 | 10.4% |
| e | 421801 | 8.3% |
| i | 409475 | 8.1% |
| o | 392073 | 7.7% |
| s | 365990 | 7.2% |
| r | 360745 | 7.1% |
| l | 312289 | 6.2% |
| n | 296798 | 5.8% |
| u | 263865 | 5.2% |
| t | 240334 | 4.7% |
| Other values (48) | 1486178 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4547110 | |
| Uppercase Letter | 527447 | 10.4% |
| Space Separator | 776 | < 0.1% |
| Other Punctuation | 437 | < 0.1% |
| Open Punctuation | 5 | < 0.1% |
| Close Punctuation | 5 | < 0.1% |
| Decimal Number | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 526234 | |
| e | 421801 | |
| i | 409475 | |
| o | 392073 | |
| s | 365990 | 8.0% |
| r | 360745 | 7.9% |
| l | 312289 | 6.9% |
| n | 296798 | 6.5% |
| u | 263865 | 5.8% |
| t | 240334 | 5.3% |
| Other values (16) | 957506 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 76336 | |
| P | 65817 | |
| C | 58147 | |
| M | 38384 | 7.3% |
| A | 38047 | 7.2% |
| S | 34182 | 6.5% |
| H | 25893 | 4.9% |
| T | 25366 | 4.8% |
| B | 24502 | 4.6% |
| L | 22782 | 4.3% |
| Other values (16) | 117991 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 431 | |
| ? | 6 | 1.4% |
Space Separator
| Value | Count | Frequency (%) |
| 776 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5074557 | |
| Common | 1225 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 526234 | 10.4% |
| e | 421801 | 8.3% |
| i | 409475 | 8.1% |
| o | 392073 | 7.7% |
| s | 365990 | 7.2% |
| r | 360745 | 7.1% |
| l | 312289 | 6.2% |
| n | 296798 | 5.8% |
| u | 263865 | 5.2% |
| t | 240334 | 4.7% |
| Other values (42) | 1484953 |
Common
| Value | Count | Frequency (%) |
| 776 | ||
| . | 431 | |
| ? | 6 | 0.5% |
| ( | 5 | 0.4% |
| ) | 5 | 0.4% |
| 0 | 2 | 0.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5075782 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 526234 | 10.4% |
| e | 421801 | 8.3% |
| i | 409475 | 8.1% |
| o | 392073 | 7.7% |
| s | 365990 | 7.2% |
| r | 360745 | 7.1% |
| l | 312289 | 6.2% |
| n | 296798 | 5.8% |
| u | 263865 | 5.2% |
| t | 240334 | 4.7% |
| Other values (48) | 1486178 |
subgenus
Text
Missing 
| Distinct | 2470 |
|---|---|
| Distinct (%) | 11.1% |
| Missing | 702202 |
| Missing (%) | 96.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 17 |
| Mean length | 10.61570878 |
| Min length | 3 |
Unique
| Unique | 735 ? |
|---|---|
| Unique (%) | 3.3% |
Sample
| 1st row | Asterocyclina |
|---|---|
| 2nd row | Radiatrypa |
| 3rd row | Laevidentalium |
| 4th row | Vacoea |
| 5th row | Phyllonotus |
| Value | Count | Frequency (%) |
| nephrolepidina | 547 | 2.5% |
| lingulella | 440 | 2.0% |
| lingulepis | 430 | 1.9% |
| lepidocyclina | 379 | 1.7% |
| dyoros | 329 | 1.5% |
| eulepidina | 285 | 1.3% |
| discocyclina | 264 | 1.2% |
| vacoea | 243 | 1.1% |
| chlamys | 239 | 1.1% |
| proporocyclina | 214 | 1.0% |
| Other values (2461) | 18944 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 25775 | 10.9% |
| i | 22604 | 9.5% |
| o | 18830 | 8.0% |
| e | 18657 | 7.9% |
| r | 16116 | 6.8% |
| l | 16112 | 6.8% |
| s | 14304 | 6.0% |
| c | 11983 | 5.1% |
| t | 11285 | 4.8% |
| n | 11277 | 4.8% |
| Other values (48) | 69851 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 214453 | |
| Uppercase Letter | 22303 | 9.4% |
| Close Punctuation | 15 | < 0.1% |
| Space Separator | 8 | < 0.1% |
| Dash Punctuation | 6 | < 0.1% |
| Other Punctuation | 6 | < 0.1% |
| Open Punctuation | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 25775 | |
| i | 22604 | |
| o | 18830 | 8.8% |
| e | 18657 | 8.7% |
| r | 16116 | 7.5% |
| l | 16112 | 7.5% |
| s | 14304 | 6.7% |
| c | 11983 | 5.6% |
| t | 11285 | 5.3% |
| n | 11277 | 5.3% |
| Other values (16) | 47510 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 3361 | |
| L | 2407 | |
| C | 1994 | |
| A | 1878 | 8.4% |
| M | 1420 | 6.4% |
| S | 1416 | 6.3% |
| N | 1385 | 6.2% |
| T | 1191 | 5.3% |
| D | 1188 | 5.3% |
| E | 1176 | 5.3% |
| Other values (16) | 4887 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 5 | |
| ? | 1 | 16.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 15 |
Space Separator
| Value | Count | Frequency (%) |
| 8 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 236756 | |
| Common | 38 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 25775 | 10.9% |
| i | 22604 | 9.5% |
| o | 18830 | 8.0% |
| e | 18657 | 7.9% |
| r | 16116 | 6.8% |
| l | 16112 | 6.8% |
| s | 14304 | 6.0% |
| c | 11983 | 5.1% |
| t | 11285 | 4.8% |
| n | 11277 | 4.8% |
| Other values (42) | 69813 |
Common
| Value | Count | Frequency (%) |
| ) | 15 | |
| 8 | ||
| - | 6 | 15.8% |
| . | 5 | 13.2% |
| ( | 3 | 7.9% |
| ? | 1 | 2.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 236794 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 25775 | 10.9% |
| i | 22604 | 9.5% |
| o | 18830 | 8.0% |
| e | 18657 | 7.9% |
| r | 16116 | 6.8% |
| l | 16112 | 6.8% |
| s | 14304 | 6.0% |
| c | 11983 | 5.1% |
| t | 11285 | 4.8% |
| n | 11277 | 4.8% |
| Other values (48) | 69851 |
specificEpithet
Text
Missing 
| Distinct | 32184 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 197674 |
| Missing (%) | 27.3% |
| Memory size | 5.5 MiB |
Length
| Max length | 31 |
|---|---|
| Median length | 21 |
| Mean length | 7.031748141 |
| Min length | 1 |
Unique
| Unique | 10223 ? |
|---|---|
| Unique (%) | 1.9% |
Sample
| 1st row | lunatus |
|---|---|
| 2nd row | hyatti |
| 3rd row | sculpturata |
| 4th row | cuspidata |
| 5th row | rotundobesus |
| Value | Count | Frequency (%) |
| sp | 136976 | 25.7% |
| splendens | 12400 | 2.3% |
| phaeopygia | 3232 | 0.6% |
| species | 2814 | 0.5% |
| a | 2244 | 0.4% |
| bella | 2150 | 0.4% |
| alba | 2016 | 0.4% |
| megalodon | 1645 | 0.3% |
| confluens | 1466 | 0.3% |
| obscura | 1275 | 0.2% |
| Other values (32112) | 367401 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 492867 | |
| a | 409545 | |
| i | 366458 | |
| e | 293309 | 7.9% |
| n | 257096 | 6.9% |
| p | 241847 | 6.5% |
| r | 211113 | 5.7% |
| l | 197470 | 5.3% |
| u | 185989 | 5.0% |
| o | 183662 | 5.0% |
| Other values (34) | 865208 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3692426 | |
| Space Separator | 6785 | 0.2% |
| Other Punctuation | 3066 | 0.1% |
| Decimal Number | 1873 | 0.1% |
| Dash Punctuation | 413 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 492867 | |
| a | 409545 | |
| i | 366458 | |
| e | 293309 | 7.9% |
| n | 257096 | 7.0% |
| p | 241847 | 6.5% |
| r | 211113 | 5.7% |
| l | 197470 | 5.3% |
| u | 185989 | 5.0% |
| o | 183662 | 5.0% |
| Other values (16) | 853070 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 923 | |
| 2 | 534 | |
| 3 | 195 | 10.4% |
| 4 | 89 | 4.8% |
| 5 | 66 | 3.5% |
| 6 | 38 | 2.0% |
| 7 | 19 | 1.0% |
| 8 | 5 | 0.3% |
| 9 | 2 | 0.1% |
| 0 | 2 | 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 3033 | |
| ' | 21 | 0.7% |
| ? | 6 | 0.2% |
| * | 5 | 0.2% |
| # | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 6785 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 413 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3692426 | |
| Common | 12138 | 0.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 492867 | |
| a | 409545 | |
| i | 366458 | |
| e | 293309 | 7.9% |
| n | 257096 | 7.0% |
| p | 241847 | 6.5% |
| r | 211113 | 5.7% |
| l | 197470 | 5.3% |
| u | 185989 | 5.0% |
| o | 183662 | 5.0% |
| Other values (16) | 853070 |
Common
| Value | Count | Frequency (%) |
| 6785 | ||
| . | 3033 | |
| 1 | 923 | 7.6% |
| 2 | 534 | 4.4% |
| - | 413 | 3.4% |
| 3 | 195 | 1.6% |
| 4 | 89 | 0.7% |
| 5 | 66 | 0.5% |
| 6 | 38 | 0.3% |
| ' | 21 | 0.2% |
| Other values (8) | 41 | 0.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3704564 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 492867 | |
| a | 409545 | |
| i | 366458 | |
| e | 293309 | 7.9% |
| n | 257096 | 6.9% |
| p | 241847 | 6.5% |
| r | 211113 | 5.7% |
| l | 197470 | 5.3% |
| u | 185989 | 5.0% |
| o | 183662 | 5.0% |
| Other values (34) | 865208 |
Missing 
| Distinct | 3295 |
|---|---|
| Distinct (%) | 20.0% |
| Missing | 708037 |
| Missing (%) | 97.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 18 |
| Mean length | 8.558557465 |
| Min length | 1 |
Unique
| Unique | 1244 ? |
|---|---|
| Unique (%) | 7.6% |
Sample
| 1st row | amplexoides |
|---|---|
| 2nd row | grandis |
| 3rd row | canalis |
| 4th row | cooperensis |
| 5th row | pyramidale |
| Value | Count | Frequency (%) |
| burchelli | 494 | 3.0% |
| halli | 243 | 1.5% |
| a | 159 | 1.0% |
| pugilla | 151 | 0.9% |
| spinifera | 136 | 0.8% |
| b | 135 | 0.8% |
| antarctica | 104 | 0.6% |
| bellaplicata | 81 | 0.5% |
| nasiterna | 79 | 0.5% |
| minor | 78 | 0.5% |
| Other values (3272) | 14872 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 18791 | |
| i | 14907 | |
| s | 13226 | |
| e | 11648 | 8.3% |
| n | 10012 | 7.1% |
| t | 8967 | 6.4% |
| r | 8880 | 6.3% |
| l | 8863 | 6.3% |
| u | 7809 | 5.5% |
| o | 7067 | 5.0% |
| Other values (37) | 30798 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 140678 | |
| Dash Punctuation | 99 | 0.1% |
| Decimal Number | 63 | < 0.1% |
| Space Separator | 61 | < 0.1% |
| Uppercase Letter | 43 | < 0.1% |
| Other Punctuation | 24 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 18791 | |
| i | 14907 | |
| s | 13226 | |
| e | 11648 | 8.3% |
| n | 10012 | 7.1% |
| t | 8967 | 6.4% |
| r | 8880 | 6.3% |
| l | 8863 | 6.3% |
| u | 7809 | 5.6% |
| o | 7067 | 5.0% |
| Other values (16) | 30508 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 15 | |
| B | 7 | |
| D | 5 | 11.6% |
| L | 4 | 9.3% |
| F | 3 | 7.0% |
| I | 2 | 4.7% |
| G | 1 | 2.3% |
| S | 1 | 2.3% |
| E | 1 | 2.3% |
| A | 1 | 2.3% |
| Other values (3) | 3 | 7.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 39 | |
| 3 | 11 | 17.5% |
| 2 | 9 | 14.3% |
| 4 | 3 | 4.8% |
| 5 | 1 | 1.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 99 |
Space Separator
| Value | Count | Frequency (%) |
| 61 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 24 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 140721 | |
| Common | 247 | 0.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 18791 | |
| i | 14907 | |
| s | 13226 | |
| e | 11648 | 8.3% |
| n | 10012 | 7.1% |
| t | 8967 | 6.4% |
| r | 8880 | 6.3% |
| l | 8863 | 6.3% |
| u | 7809 | 5.5% |
| o | 7067 | 5.0% |
| Other values (29) | 30551 |
Common
| Value | Count | Frequency (%) |
| - | 99 | |
| 61 | ||
| 1 | 39 | 15.8% |
| . | 24 | 9.7% |
| 3 | 11 | 4.5% |
| 2 | 9 | 3.6% |
| 4 | 3 | 1.2% |
| 5 | 1 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 140968 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 18791 | |
| i | 14907 | |
| s | 13226 | |
| e | 11648 | 8.3% |
| n | 10012 | 7.1% |
| t | 8967 | 6.4% |
| r | 8880 | 6.3% |
| l | 8863 | 6.3% |
| u | 7809 | 5.5% |
| o | 7067 | 5.0% |
| Other values (37) | 30798 |
taxonRank
Text
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 707802 |
| Missing (%) | 97.7% |
| Memory size | 5.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 8.738058183 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | subspecies |
|---|---|
| 2nd row | variety |
| 3rd row | subspecies |
| 4th row | variety |
| 5th row | subspecies |
| Value | Count | Frequency (%) |
| subspecies | 9791 | |
| variety | 6728 | |
| forma | 134 | 0.8% |
| morpha | 37 | 0.2% |
| clade | 16 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 29373 | |
| e | 26326 | |
| i | 16519 | |
| p | 9828 | 6.7% |
| b | 9791 | 6.7% |
| c | 9791 | 6.7% |
| u | 9791 | 6.7% |
| a | 6915 | 4.7% |
| r | 6899 | 4.7% |
| v | 6728 | 4.6% |
| Other values (9) | 14017 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 145962 | |
| Uppercase Letter | 16 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 29373 | |
| e | 26326 | |
| i | 16519 | |
| p | 9828 | 6.7% |
| b | 9791 | 6.7% |
| c | 9791 | 6.7% |
| u | 9791 | 6.7% |
| a | 6915 | 4.7% |
| r | 6899 | 4.7% |
| v | 6728 | 4.6% |
| Other values (8) | 14001 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 16 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 145978 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| s | 29373 | |
| e | 26326 | |
| i | 16519 | |
| p | 9828 | 6.7% |
| b | 9791 | 6.7% |
| c | 9791 | 6.7% |
| u | 9791 | 6.7% |
| a | 6915 | 4.7% |
| r | 6899 | 4.7% |
| v | 6728 | 4.6% |
| Other values (9) | 14017 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 145978 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 29373 | |
| e | 26326 | |
| i | 16519 | |
| p | 9828 | 6.7% |
| b | 9791 | 6.7% |
| c | 9791 | 6.7% |
| u | 9791 | 6.7% |
| a | 6915 | 4.7% |
| r | 6899 | 4.7% |
| v | 6728 | 4.6% |
| Other values (9) | 14017 |
Missing 
| Distinct | 7319 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 325030 |
| Missing (%) | 44.9% |
| Memory size | 5.5 MiB |
Length
| Max length | 103 |
|---|---|
| Median length | 51 |
| Mean length | 9.144288296 |
| Min length | 2 |
Unique
| Unique | 1579 ? |
|---|---|
| Unique (%) | 0.4% |
Sample
| 1st row | Meek |
|---|---|
| 2nd row | (Cushman) |
| 3rd row | (Agassiz) |
| 4th row | Cooper & Grant |
| 5th row | Cuvier |
| Value | Count | Frequency (%) |
| 77310 | 13.1% | |
| walcott | 26311 | 4.5% |
| cooper | 26282 | 4.4% |
| cushman | 17375 | 2.9% |
| grant | 16892 | 2.9% |
| ulrich | 12249 | 2.1% |
| et | 9463 | 1.6% |
| al | 9463 | 1.6% |
| hall | 8176 | 1.4% |
| bassler | 5943 | 1.0% |
| Other values (4208) | 381568 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 302103 | 8.3% |
| a | 256596 | 7.0% |
| o | 243833 | 6.7% |
| r | 239853 | 6.6% |
| n | 225453 | 6.2% |
| l | 204010 | 5.6% |
| 191554 | 5.2% | |
| t | 170449 | 4.7% |
| i | 153159 | 4.2% |
| s | 150944 | 4.1% |
| Other values (66) | 1514988 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2653589 | |
| Uppercase Letter | 500242 | 13.7% |
| Space Separator | 191554 | 5.2% |
| Open Punctuation | 106388 | 2.9% |
| Close Punctuation | 106388 | 2.9% |
| Other Punctuation | 92459 | 2.5% |
| Dash Punctuation | 2322 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 302103 | |
| a | 256596 | |
| o | 243833 | |
| r | 239853 | 9.0% |
| n | 225453 | 8.5% |
| l | 204010 | 7.7% |
| t | 170449 | 6.4% |
| i | 153159 | 5.8% |
| s | 150944 | 5.7% |
| h | 100899 | 3.8% |
| Other values (31) | 606290 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 79537 | |
| W | 50766 | 10.1% |
| G | 43481 | 8.7% |
| S | 39975 | 8.0% |
| B | 33936 | 6.8% |
| M | 30616 | 6.1% |
| H | 30047 | 6.0% |
| L | 27178 | 5.4% |
| R | 20489 | 4.1% |
| P | 15813 | 3.2% |
| Other values (16) | 128404 |
Other Punctuation
| Value | Count | Frequency (%) |
| & | 77309 | |
| . | 9415 | 10.2% |
| ' | 5434 | 5.9% |
| , | 300 | 0.3% |
| ? | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 191554 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 106388 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 106388 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2322 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3153831 | |
| Common | 499111 | 13.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 302103 | 9.6% |
| a | 256596 | 8.1% |
| o | 243833 | 7.7% |
| r | 239853 | 7.6% |
| n | 225453 | 7.1% |
| l | 204010 | 6.5% |
| t | 170449 | 5.4% |
| i | 153159 | 4.9% |
| s | 150944 | 4.8% |
| h | 100899 | 3.2% |
| Other values (57) | 1106532 |
Common
| Value | Count | Frequency (%) |
| 191554 | ||
| ( | 106388 | |
| ) | 106388 | |
| & | 77309 | |
| . | 9415 | 1.9% |
| ' | 5434 | 1.1% |
| - | 2322 | 0.5% |
| , | 300 | 0.1% |
| ? | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3650564 | |
| None | 2378 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 302103 | 8.3% |
| a | 256596 | 7.0% |
| o | 243833 | 6.7% |
| r | 239853 | 6.6% |
| n | 225453 | 6.2% |
| l | 204010 | 5.6% |
| 191554 | 5.2% | |
| t | 170449 | 4.7% |
| i | 153159 | 4.2% |
| s | 150944 | 4.1% |
| Other values (50) | 1512610 |
None
| Value | Count | Frequency (%) |
| ú | 939 | |
| ö | 833 | |
| ž | 158 | 6.6% |
| å | 99 | 4.2% |
| ë | 95 | 4.0% |
| ä | 74 | 3.1% |
| ü | 64 | 2.7% |
| é | 48 | 2.0% |
| ó | 17 | 0.7% |
| ñ | 16 | 0.7% |
| Other values (6) | 35 | 1.5% |